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ACAGGTGGCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTGG AGCAAGTATG 7260 

GTGCAGGTGG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 7320 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7380 

TTCCGCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTAGCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACCGA GCCTGAGCCT ATTTACAGGG 7500 

ACAGATCAGG GTCAGTTTCA AGTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TACTTTACAA GGATGTGGAA 7620 

CTGGAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGACCTTAA GTCATTTGAG 7740 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTCTG 7920 

CAGTTTACCA ATCATTCTTT TGAAAATCTC CTCATCAAGT CGGGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT GAGGCGCGCG ATTTGGTGAG ACACTTGACC TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCAGGTGA 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AGAGGACTTT 8340 

CTTTGACCTT TTTGAAGCAA GCTATTTCCT ATTTGATTGA GACAGACTAT CAGATTAAGA 8400 

CAGCTCTTTA TGAAAAAGGT TTCCTTTTTG AAAAGGCACT CTTACAGATT GCTAGTCAGG 8460 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 3494 
(2) INFORMATION FOR SEQ ID NO: 164; 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9707 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
{D> TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATGAACGAT CAATCAGAAA AAAAGACTAG 



60 
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AAAGAAGACT GTATGGATAA TCGACCAATT GGTTTTTTGG ATTCGGGTGT CGGGGGCTTG 120 

ACCGTTGT G C GCGAGCTCAT GCGCCAGCTT CCCCATGAAG AAATCGTCTA TATTGGAGAT 180 

TCGGCGCGGG CGCCCTATGG CCCCCGTCCT GCTGAGCAAA TTCGTGAATA TACTTGGCAG 240 

CTGGTCAACT TTCTCTTGAC CAAGGATGTC AAAATGATTG TCATTGCTTG TAACACTGCG 300 

ACTGCGGTCG TCTGGGAAGA AATCAAGGCT CAACTAGATA TTCCTGTCTT GGGTGTAATT 360 

TTGCCAGGAG CTTCGGCAGC CATCAAGTCC AGTCAAGGTG GGAAAATCGG AGTGATTGGA 420 

ACGCCCATGA CGGTACAATC AGACATATAC CGTCAGAAAA TCCATGATCT GGATCCCGAC 480 

TTACAGGTGG AGAGCTTGGC CTGTCCCAAG TTTGCTCCCT TGGTTGAGTC AGGTGCCCTG 540 

TCAACCAGTG TTACCAAGAA GGTGGTCTAT GAAACCCTGC GTCCCTTGGT TGGAAAGGTG 600 

GATAGCCTGA TTTTGGGCTG TACTCATTAT CCACTCCTTC GCCCTATTAT CCAAAATGTG 660 

ATGGGGCCAA AGGTTCAGCT CATCGATAGT GGGGCAGAGT GCGTACGGGA TATCTCAGTC 720 

TTACTCAATT ATTTTGAAAT CAATCGTGGT CGCGATGCTG GACCACTCCA TCACCGTTTT 780 

TACACAACAG CCAGTAGCCA AAGTTTTGCA CAAATTGGTG AAGAATGGCT GGAAAAAGAG 840 

ATTCATGTGG AGCATGTAGA ATTATGACAA ATAAAATTTA TGAATATAAG GATGACCAGG 900 

ACTGGTATGT TGGGTCTTAT AGTATTTTTG GTGGCGTTAA CAGTTTGAGC GACTATAAGA 960 

CAGATTTTCC TCTGTTTGAA TTCTCCAAAA TATTTGGAGA TGAAGAGTAT GGTTTCCCGC 1020 

TTTCAGTTAC TGTTTTACGC TATGGTTCTA TCTACCGTTT GTTCTCCTTT GTGGTAGACA 1080 

TGCTTAATCA AGAAATGGGA CGAAACTTGG AAGTTATTCA ACGTCATGGG GCCCTGCTCT 1140 

TGGTTGAAAA TGGGCAACTC TTGTATGTAG AATTGCCTAA AGAAGGGGTC AATGTTCATG 1200 

ATTTCTTTGA GACAAGCAAG GTCAGAGAAA CCTTGTTGAT TGCGACTCGT AACGAAGGTA 1260 

AAACCAAGGA ATTCCGAGCT ATCTTTGATA AGTTAGGCTA CGATGTGGAA AATCTTAATG 1320 

ACTACCCTGA CCTGCCTGAA GTAGCAGAAA CAGGTATGAC CTTTGAAGAA AATGCCCGCC 1380 

TTAAGGCAGA AACCATTTCT CAATTAACGG GCAAGATGGT TTTGGCAGAT GATTCTGGTC 1440 

TCAAAGTCGA TGTCCTTGCT GGCTTACCAG GCGTCTGGTC AGCTCGTTTC GCAGGTGTGG 1500 

GAGCAACTGA CCGTGAAAAT AATGCCAAAC TCTTGCACGA ATTGGCCATG GTCTTTGAAC 1560 

TCAAGGACCG CTCGGCTCAG TTCCACACAA CCCTAGTCGT AGCCAGCCCA AATAAGGAAA 1620 

GTTTAGTTGT TGAAGCAGAC TGGTCAGGTT ATATTAACTT TGAACCTAAG GGTGAAAATG 1680 

GCTTTGGCTA TGATCCCCTC TTCCTTGTAG GAGAAACAGG TGAGTCATCA GCTGAATTAA 1740 

CCCTGGAAGA AAAAAATAGT CAATCTCACC GTGCCTTAGC CGTTAAGAAA CTTTTGGAGG 1800 

TATTTCCATC ATGGCAAAGC AAACCATCAT TGTAATGAGC GATTCCCATG GCGATAGCTT 1860 
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GATTGTGGAA 


GAAGTCCGTG 


ATCGCTATGT 


GGGCAAAGTC 


GATGCTGTTT 


TTCATAACGG 


1920 


CGATTCTGAA 


CTACGTCCGG 


ATTCTCCACT 


TTGGGAGGGC 


ATCCGCGTTG 


TTAAAGGGAA 


1980 


CATGGACTTC 


TACGCCGGCT 


ACCCAGAACG 


TCTGGTGACT 


GAGCTTGGTT 


CGACCAAGAT 


2040 


TATCCAAACT 


CATGGTCACT 


TGTTTGACAT 


CAATTTCAAC 


TTTCAAAAGT 


TGGACTACTG 


2100 


GGCTCAGGAG 


GAAGAGGCCG 


CTATCTGCCT 


CTATGGTCAC 


TTGCATGTGC 


CAAGTGCTTG 


2160 


CTTGGAAGGC 


AAGATCCTCT 


TTCTAAATCC 


AGGTTCTATC 


AGTCAACCAC 


GAGGTACCAT 


2220 


CAGAGAATGT 


CTCTATGCTC 


GTGTGGAGAT 


TGATGATAGT 


TACTTCAAAG 


TGGACTTTTT 


2280 


GACACGAGAT 


CACGAGGTGT 


ATCCAGGTTT 


GTCCAAGGAG 


TTTAGCCGAT 


GATTGCCAAG 


2340 


GAGTTTGAGA 


CTTTCTTGTT 


GGGGCAGGAG 


GAAACTTTTT 


TGACCCCTGC 


TAAAAATCTA 


2400 


GCTGTGTTGA 


TTGATACCCA 


CAATGCGGAT 


CATGCGACCC 


TCTTGCTCAG 


TCAGATGACC 


2460 


TATACCCGTG 


TTCCCGTTGT 


GACAGATGAA 


AAACAGTTTG 


TTGGGACGAT 


TGGACTCAGA 


2520 


GATATTATGG 


CTTATCAGAT 


GGAGCATGAC 


TTGAGCCAAG 


AAATCATGGC 


GGATACGGAT 


2580 


ATCGTTCATA 


TGACAAAAAC 


GGACGTAGCG 


GTTGTTTCGC 


CTGATTTCAC 


CATTACGGAG 


2640 


GTCTTGCACA 


AGCTAGTAGA 


TGAGTCCTTC 


TTACCGGTTG 


TGGATGCAGA 


GGGTATTTTC 


^ 2700 


CAAGGGATTA 


TTACGCGCAA 


GTCCATCCTC 


■ AAGGCCGTTA 


ATGCCCTCTT 


GCATGACTTT 


2760 


AGTAAGGAAT 


ATGAGATTCG 


ATGCCAATGA 


GAGACAGGAT 


TTCAGCCTTT 


TTAGAGGAAA 


2820 


AGCAGGGCTT 


GTCTGTCAAT 


TCCAAGCAGT 


CCTATAAGTA 


TGATTTGGAG 


CAATTTTTAG 


2880 


ACATGGTAGG 


TGAGCGGATT 


TCTGAGACCA 


GTCTCAAGAT 


TTACCAAGCC 


CAGCTAGCCA 


2940 


ATCTAAAAAT 


CAGCGCCCAG 


AAGCGAAAGA 


TTTCGGCCTG 


TAACCAATTT 


CTATACTTTC 


3000 


TCTATCAAAA 


AGGAGAGGTG 


GACAGCTTTT 


ACCGCTTGGA 


ATTAGCCAAA 


CAAGCTGAAA 


3060 


AGAAGACGGA 


AAAGCCAGAG 


ATTCTATACC 


TAGACTCTTT 


TTGGCAGGAA 


AGCGACCATC 


3120 


CAGAGGGCCG 


CTTGCTAGCG 


CTCTTAATCC 


TAGAAATGGG 


GCTCTTGCCC 


AGTGAGATTT 


3180 


TAGCCATCAA 


GGTTGCGGAC 


ATCAATCTGG 


ATTTTCAGGT 


GTTGCGAATC 


AGCAAGGCTT 


3240 


CCCAACAGAG 


GATTGTCACC 


ATTCCCACGG 


CCTTGCTTTC 


AGAATTGGAA 


CCCTTGATGG 


3300 


GGCAGACCTA TCTTTTTGAA AGAGGAGAGA AACCCTATTC TCGTCAGTGG GCCTTTCGTC 


3360 


AGTTAGAATC 


TTTTGTCAAG 


GAGAAAGGTT 


TTCCATCCTT 


ATCAGCTCAA 


GTCTTACGTG 


3420 


AACAGTTTAT TCTAAGACAA ATAGAAAACA AGGTCGATTT GTACGAAATT GCAAAAAAAT 


3480 


TAGGATTAAA 


AACAGTCCTG 


ACCTTAGAAA 


AATATAGATA ATGGATATTA 


AATTAAAAGA 


3540 


TTTTGAAGGA 


CCCCTGGACT 


TGCTCTTGCA 


TCTGGTTTCT 


AAGTACCAGA 


TGGATATCTA 


3600 
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CGATGTGCCC 


ATTACGGAAG 


TCATCGAACA 


GTATCTAGCC 


TATGTCTCAA 


CCCTGCAGGC 


3660 


CATGCGTCTG 


GAAGTGACGG 


GTGAGTACAT 


GGTCATGGCT 


AGTCAGCTCA 


TGCTGATTAA 


3720 


GAGTCGTAAA 


CTCCTTCCGA 


AGGTAGCAGA AGTGACAGAC 


TTGGGGGATG 


ACCTGGAGCA 


3780 


GGACCTCCTC 


TCTCAAATCG 


AAGAATATCG 


CAAGTTCAAG 


CTCTTGGGTG 


AGCACTTGGA 


3840 


AGCCAAGCAC 


CAAGAACGGG CCCAGTATTA TTCCAAAGCG 


CCGACAGAGT 


TGATTTACGA 


3900 


AGATGCGGAG 


CTTGTGCATG 


ACAAGACGAC 


CATTGACCTC 


TTTTTGACTT 


TTTCAAATAT 


3960 


CCTAGCCAAG 


AAAAAAGAGG 


AGTTTGCACA AAATCACACG 


ACGATCTTGC 


GGGATGAGTA 


4020 


TAAGATTGAG 


GACATGATGA 


TTATCGTGAA 


AGAGTCCTTG 


ATTGGACGAG 


ATCAATTGCG 


4080 


CTTGCAGGAT 


TTGTTCAAGG 


AAGCCCAGAA 


TGTCCAAGAG 


GTCATCACCC 


TCTTTTTGGC 


4140 


AACCCTAGAG 


TTAATCAAAA 


CCCAGGAGTT 


GATCCTCGTG 


CAAGAGGAGA 


GTTTTGGAGA 


4200 


TATCTATCTC 


ATGGAAAAGA 


AGGAAGAAAG 


TCAAGTGCCT 


CAAAGCTAGA 


CTTGATAGAG 


4260 


AGGAAAGATG 


AGTACTTTAG 


CAAAAATAGA 


AGCGCTCTTG 


TTTGTAGCGG 


GTGAAGATGG 


4320 


GATTCGGGTC 


CGCCAGTTAG 


CTGAACTCCT 


CTCTCTGCCA 


CCGACAGGCA 


TCCAGCAAAG 


4380 


TTTAGGAAAA 


TTAGCCCAGA 


AGTATGAAAA 


GGACCCAGAT 


TCCAGTTTGG 


CTTTGATTGA 


4440 


GACAAGTGGT 


GCTTATAGAT 


TGGTGACCAA 


GCCTCAATTT 


GCAGAGATTT 


TGAAGGAATA 


4500 


CTCTAAGGCG 


CCTATCAACC 


AGAGCTTGTC 


TCGGGCTGCC 


CTTGAGACCT 


TGTCCATTAT 


4560 


TGCCTACAAA 


CAGCCGATTA 


CGCGGATAGA 


AATTGATGCC 


ATCCGTGGAG 


TTAACTCGAG 


4620 


TGGAGCCTTG 


GCAAAGTTGC 


AGGCTTTTGA 


CCTGATAAAG 


GAAGACGGGA 


AAAAGGAAGT 


4680 


ATTGGGGCGC 


CCCAACCTCT 


ATGTGACTAC 


GGATTATTTC 


CTAGATTACA 


TGGGGATAAA 


4740 


CCATTTAGAA 


GAATTACCAG 


TGATTGATGA 


GCTTGAGATT 


CAAGCCCAAG 


AAAGCCAATT 


4800 


ATTTGGTGAA 


AGGATAGAAG 


AAGATGAGAA 


TCAATAAGTA 


TATTGCCCAC 


GCAGGTGTGG 


4860 


CCAGTAGGAG 


AAAAGCAGAA 


GAGCTGATTA 


AGCAAGGCTT 


GGTGACGGTT 


AACGGCCAAG 


4920 


TGGTGCGTGA 


ACTAGCAACC 


ACTATCAAGT 


CAGGCGACAA 


GGTCGAAGTT 


GAAGGTCAAC 


4980 


CTATCTACAA 


CGAAGAAAAG 


GTCTACTATC 


TGCTTAACAA 


ACCACGCGGT 


GTGATTTCCA 


5040 


GTGTGACAGA 


TGATAAGGGT 


CGCAAGACGG 


TTGTCGACCT 


CTTGCCCAAT 


GTCAAAGAGC 


510O 


GTATTTACCC 


TGTGGGTCGT 


TTGGACTGGG 


ATACATCAGG 


TGTCTTGATT 


TTGACCAATG 


5160 


ATGGGGACTT 


TACAGACGAG 


ATGATTCACC 


CTCGTAATGA 


GATTGACAAG 


GTTTATGTCG 


5220 


CGCGTGTTAA 


AGGTGTGGCC AATAAGGACA ATCTCCGCCC 


CTTGACCCGT 


GGTCTTGAGA 


5280 


TTGATGGTAA 


GAAAACCAAG 


CCAGCTGTTT 


ATGAAATTCT 


CAAAGTGGAC 


CCAGTCAAAA 


5340 


ATCGCTCTGT GGTGCAGTTG ACCATCCATG AAGGGCGTAA CCATCAGGTT AAAAAGATGT 


5400 
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TTGAAGCTGT TGGTCTCCAA GTAGATAAGT TGTCTCGGAC TCGTTTCGGA CACCTAGACT 54 60 

TGACAGGACT CCGTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC AGCCAACTAC 5520 

ACACCATGGC TGTAACTAAG AAATAATGAA ACGAATTTTA ATAGCGCCTG TGCGCTTTTA 5580 

CCAACGTTTT ATCTCACCAG TCTTTCCACC CTCTTGTCGC TTTGAGCTGA CTTGTTCCAA 5640 

CTACATGATT CAGGCTATTG AAAAACATGG GTTTAAGGGG GTATTGATGG GCTTGGCTCG 5700 

GATTTTACGT TGTCATCCCT GGTCGAAAAC AGGTAAGGAC CCCGTTCCAG ACCGCTTTTC 5760 

CCTTAAACGA AATCAAGAAG GCGAATGACG TGGGGTAAAT AGATTTCAAA ATGATAAAAA 5B20 

CGCATCCTAT CAGGTTTGAG TGAACTTGAT AGGATGCGTT TTAGAATGTC AAAATTTTAT 5880 

ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTGTTT 5940 

TGAGCAACCT GCGGCTAGTT TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTAAATTGA 6000 

GTTTGAAGTG GCTTATTTCA AAGCTTTTTG TATGTCTTCA ATCATGAGTT TTGTTGATTC 6060 

AAGTCCGCCT CCGCTTAGAT ACCAGAGGTC TGGTGTTAGT TGGATAATCT TACCATTTTT 6120 

AGCAGCAGGT GTTTCAGCGA TAAGGGCATT TTCTAGGACA CCGTCGTTGC TAGAGTTGTC 6180 

CCCACCGATG GCAAGGGTAC GGTTGATGAC AAAGAGGATG TCAGGGTTGA TTTCTTTGAC 6240 

ACTTTCAAAG CTGACTTCTT GTCCGTGGCG TGAGTCTTCA AATTTTGTAT CAGTTGGTTT 6300 

GAATTTCAAG GTTTGGTACA AGAAAGAGAA ACGAGATTTG GCACCAAAGG CTGCCATTTT 6360 

TCCTTCATTA AGGAGGATCG CAAGGGCTTT TTTGTCAGAG CTTTCATTTT TAGTAGCGAC 6420 

TTCTTGGATG CTCTTGTCTA GCTTGGTCAA TTCTTCCTTG GCTTTCTGTG TACCAGTTTC 6480 

GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 6540 

GCTTGCTTGG AAGAGAACGG TTGGGGCGAT TTCTTTGAAT TTGTCTACGA ATTTTTGTGT 6600 

ACGTGGCGAA GCGATAATCA AATCAGGCTC AAGGGCGGCG ATAGCTTCTA AATCAGGTTC 6660 

TTTCATAGAA CCAACATTTT TGACAGTTCC CACTAGGTCT TTTAGATAAG TCGGAACAGT 6720 

TTTTGTAGGC ATTCCGACGA TATTTTTTTC AAATCCTAAA GCGCGAATAG TATCCGCAGC 6780 

GCCGAGGTCA AAGGTCACAA TCTTTTCAGG AACTTTGGAA AGTTTGACCT CGTCCAGTGA 6840 

ACTTTTAATG GTTACCTCTG TTGGAGCAGA GCTACTGGTC TCTGTCTGAC TAGTGCTTGA 6900 

GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 6960 

TTTAAGGGAT GTTTTCATAA TTTCTCCTTT TTAAAATGTG ATAACGATTT AGGGAGTCTC 7020 

TTAATCTTAT TGACTAAGAG ACTGAACGTT CTCTAACTTG AGCTTTTATG TTACTAGCTA 7080 

TAGATACAGA TCTTTTTGTC ATTGATATCA GCTAGCGTGA TGGGAATCTC ATAAAGTTGA 7140 
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CTGAGCAGGT CAGCCTGCAT GATTTGATCG CTTCTTCCCT TGCTAAAGAC CTGGCCCTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TGCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCCT ATCCTGGGCC 7380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG ACAGGCTATT GATGTAGCGG 7440 

TCTTTTAAGT TGCTCAGTTC TAAATAGTTC AGAGTTTCTC GGATTTTTTC CCAGTCTTCT 7500 

GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7560 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGGATGGTTA GTTCTTGGGC CAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 7740 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 7860 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7920 

CAATCAAGGC TTGCCCCAAG GTTAAGCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7980 

GTAACTTGTG CTGATAGTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGAAGGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCTTT CT G TTCTTTT TCAACATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

GAAAGAGACT TCTGAGCGCT AGACAGATCA GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 8460 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAG1A 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCGTCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 8940 
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TGACTTAGAT TATCTCCATA GCGCTTGCGA ACAAGATTGG GAACGATAAC TCCGAGAAAT 9000 

GGTAGGCCAC CCACGGTAAT CATCGTGACG CTTGTCGTTA GCGCCACCAG AAAGAGGGCC 9060 

AGTTTTTCAA GTAGGGAGTA GGAAATCCCC AAACTCTCGC TGGTTTCTTT CCCTAGATTC 9120 

ATGATGGTGA AGGTTTGGGA TAATTTCCAA ACGGTTATCA GGATGATGAG GCCTAAGAAG 9180 

AGCCACTCAT ACTGATGGGT CTGAATCATG GAGAAGGAGC CCTGGGTCCA GGCAGTCATA 9240 

CTCTGAACCA GATTGAAACG ATAGGCGATA ACTTCTGTGA CTGAGCCGAT AATCCCGCTA 9300 

TAGATGATCC CAATCAGAGG CAACATCCAC CTTTCCTTTA CAGTAAAAAT GGTCATAAAG 9360 

GCTAGGAAGA AGAGGGTGAA TACGATGGAT GAAACAAAAG CGAAGACCAT CTTGTGGGTC 9420 

AGACTAGCCG ATGGAAAGAC AAAAAGGCTC AGCACCATTC CCAGTTTGGC GGCTTCAGTC 9480 

GTTCCAACTG TACTCGGTGC AGCAAACTGA TTTTGGGTAA TAGTCTGCAT GAGAAGGCCT 9540 

GCCATACTCA TACTAGAGGC AGTCAGGAGA ATACTGATAG TTCTTGGGAG ACGGGACTCT 9600 

TGAAAGAGGA GCCAGGTCTG CTGGTCGAAA TCAAATAGCT TTCCCCATGA AAAATCACTG 9660 

GTCCCAATGC TAATAGAGAG AAAGACTAGG AGTAGAAGTA AGCCAGG 9707 
(2) INFORMATION FOR SEQ ID NO: 165: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5910 base pairs 
(B> TYPE: nucleic acid 
<C) STRANDEONESS: double 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
CCGCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTGAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGCTATT ATGAAAAAAC GTCAAAGTGG 120 

TGTGTTGATG CACATCTCTT CTCTTCCAGG AGCTTACGGA ATCGGATCAT TTGGTCAAAG 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 240 

ATTAGGACCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 300 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 360 

TGAAGGAGTT GACTTTGGTA GCGATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC 420 

ACGTCCTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 480 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTTGAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT TCAGCACTTG AAAGCTATCG TGAGCAATTG GCAGACAAGT TGGTTTACCA 660 

CCGTGTGACT CAATACTTCT TCTTCCAACA ATGGTTGAAA TTGAAAGCTT ACGCTAACGA 720 

CAACCACATC GAAATCGTTG GGGACATGCC AATCTACGTA GCGGAAGATT CAAGTGATAT 780 

GTGGGCAAAT CCACATCTCT TCAAAACAGA TGTCAATGGT AAGGCTACTT GTATCGCAGG 840 

ATGCCCACCA GATGAGTTTT CTGTAACTGG TCAGCTTTGG GGTAATCCAA TCTATGACTG 900 

GGAAGCAATG GACAAAGACG GCTACAAATG GTGGATTGAA CGCTTGCGTG AAAGCTTCAA 960 

AATCTACGAT ATCGTTCGTA TCGACCACTT CCGTGGCTTC GAATCTTACT GGGAAATCCC 1020 

TCCTGGTTCC GATACAGCAG CACCTGGTGA GTGGGTGAAA GGTCCAGGTT ACAAGCTTTT 1080 

TGCAGCCGTT AAGGAAGAAC TTGGTGAGCT AAACATCATC GCAGAAGACC TTGGCTTCAT 1140 

GACAGATGAA GTGATCGAAT TGCGTGAACG TACTGGCTTC CCAGGAATGA AGATTCTTCA 1200 

ATTTGCCTTC AACCCAGAAG ACGAAAGCAT TGATAGCCCA CACTTGGCAC CTGCTAACTC 1260 

AGTTATGTAC ACAGGAACAC ACGATAACAA TACGGTTCTT GGTTGGTACC GTAATGAGAT 1320 

TGATGATGCG ACTCGTGAGT ACATGGCTCG TTACACGAAC CGTAAAGAAT ACGAAACAGT 1380 

GGTACACGCT ATGCTTCGTA CAGTATTTTC ATCAGTTAGC TTTATGGCAA TTGCAACTAT 1440 

GCAAGATTTA CTAGAATTGG ATGAGGCAGC TCGTATGAAC TTCCCATCTA CCCTTGGTGG 1500 

AAACTGGTCT TGGCGTATGA CTGAAGATCA ATTGACACCA GCTGTCGAGG AAGGTTTGCT 1560 

TGACTTGACA ACAATTTATC GCCGAATTAA TGAAAATTTG GTAGATTTAA AGAAATAAGA 1620 

CAATAATCAG GAGACAACTA AACATGTTAT CACTACAAGA ATTTGTACAA AATCGTTACA 1680 

ATAAAACCAT TGCAGAATGT AGCAATGAAG AGCTTTACCT TGCTCTTCTT AACTACAGCA 1740 

AGCTTGCAAG CAGCCAAAAA CCAGTCAACA CTGGTAAGAA AAAAGTTTAC TACATCTCAG 1800 

CTGAGTTCTT GATTGGTAAA CTCTTGTCAA ACAACTTGAT TAACCTTGGT CTTTACGACG 1860 

ATGTTAAAAA AGAACTTGCA GCTGCAGGTA AAGACTTGAT CGAAGTTGAA GAAGTTGAAT 1920 

TGGAACCATC TCTTGGTAAT GGTGGTTTGG GACGTTTGGC TGCCTGCTTT ATCGACTCAA 1980 

TTGCTACTCT TGGTTTGAAT GGTGACGGTG TTGGTCTTAA CTACCACTTT GGTCTTTTCC 2040 

AACAAGTTCT TAAAAACAAC CAACAAGAAA CAATTCCAAA TGCATGGTTG ACAGAGCAAA 2100 

ACT GG TT GG T TCGCTCAAGC CGTAGCTACC AAGTACCATT TGCAGACTTT ACTTTGACAT 2160 

CAACTCTTTA CGATATTGAT GTTACTGGTT ATGAAACAGC GACTAAAAAC CGCTTGCGTT 2220 

TGTTTGACTT GGATTCAGTT GATTCTTCTA TTATTAAAGA TGGTATCAAC TTTGACAAGA 2280 

CAGATATCGC TCGCAACTTA ACTCTCTTCC TTTACCCAGA TGATAGTGAC CGTCAAGGTG 2340 

AATTGCTCCG TATCTTCCAA CAATACTTCA TGGTTTCAAA CGGTGCGCAA TTGATCATCG 2400 
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ACGAAGCAAT 


CGAAAAAGGA 


AGCAACTTGC 


ATGACCTTGC 


TGACTACGCA 


GTTGTCCAAA 


2460 


TCAACGATAC 


TCACCCATCA 


ATGGTGATTC 


CTGAATTGAT 


TCGTCTTTTG 


ACTGCACGTG 


2520 


GTATCGATCT 


TGACGAAGCA 


ATCTCAATTG 


TTCGTAGCAT 


GACTGCCTAC 


ACTAACCACA 


2580 


CAATCCTTGC 


TGAAGCGCTT 


GAAAAATGGC 


CTCTTGAATT 


CTTGCAAGAA 


GTGGTTCCTC 


2640 


ACTTGGTACC 


AATCATCGAA 


GAATTGGACC 


GTCGTGTGAA 


GGCAGAGTAC 


AAAGATCCAG 


2700 


CTGTTCAAAT 


CATCGATGAG 


AGCGGACGTG 


TTCACATGGC 


TCACATGGAT 


ATCCACTACG 


2760 


GATACAGTCT 


TAACGGGCTT 


GCAGCACTCC 


ATACTGAAAT 


CTTGAAAAAT 


TCTGAGTTGA 


2820 


AAGCCTTCTA 


CGACCTTTAC 


CCAGAAAAGT 


TCAACAACAA 


AACAAACGGT 


ATCACTTTCC 


2880 


GTCGTTGGCT 


TATGCATGCT 


AACCCAAGAT 


TGTCTCACTA 


CTTGGATGAG 


ATTCTTGGAG 


2940 


ATGGTTGGCA 


CCATGAAGCA 


GATGAGCTTG 


AAAAACTTTT 


GTCTTATGAA 


GACAAAGCAG 


3000 


TTGTCAAAGA 


AAAATTGGAA 


AGCATCAAGG 


CTCACAACAA 


ACGTAAATTG 


GCTCGTCACT 


3060 


TGAAAGAACA 


CCAAGGTGTG 


GAAATCAATC 


CAAATTCTAT 


CTTTGATATC 


CAAATCAAAC 


3120 


GTCTTCACGA 


GTACAAACGC 


CAACAAATGA 


ACGCTTTGTA 


CGTGATCCAC 


AAATACCTTG 


3180 


ACATCAAAGC 


TGGTAACATC 


CCTGCTCGTC 


CAATCACAAT 


CTTCTTTGGT 


GGTAAAGCAG 


3240 


CTCCAGCCTA 


CACAATCGCT 


CAAGACATTA 


TCCATTTAAT 


CCTTTGCATG 


TCAGAAGTTA 


3300 


TTGCTAACGA 


TCCAGCAGTA 


GCTCCACACT 


TGCAAGTAGT 


TATGGTTGAA 


AACTACAACG 


3360 


TTACTGCAGC 


AAGTTTCCTT 


ATCCCAGCAT 


GTGATATCTC 


AGAACAAATC 


TCACTTGCTT 


3420 


CTAAAGAAGC 


TTCAGGTACT 


GGTAACATGA 


AATTCATGTT 


GAACGGAGCT 


TTGACACTTG 


3480 


CTACTATGGA 


CGGTGCTAAC 


GTGGAAATCG 


CTGAGTTGGT 


TGGAGAAGAA 


AACATCTACA 


3540 


TCTTCGGTGA 


AGATTCAGAA 


ACTGTTATCG 


ACCTTTACGC 


AAAAGCAGCT 


TACAAATCAA 


3600 


GCGAATTCTA 


CGCTCGTGAA 


GCTATCAAAC 


CATTGGTTGA 


CTTCATCGTT 


AGTGATGCAG 


3660 


TTCTTGCAGC 


TGGAAACAAA 


GAGCGCTTGG 


AACGTTTTTA 


CAATGAATTG 


ATCAACAAAG 


3720 


ACTGGTTCAT 


GACTCTTCTT 


GATTTGGAAG 


ACTACATCAA AGTCAAAGAG 


CAAATGCTTG 


3780 


CTGACTACGA 


AGACCGTGAC 


GCATGGTTGG 


ATAAAGTCAT 


CGTTAACATT 


TCTAAAGCAG 


3840 


GATTCTTCTC 


ATCTGACCGT 


ACAATOGCTC 


AGTATAACGA 


AGACATCTGG 


CACTTGAACT 


3900 


AATACTCTTC 


GAAAATCTCT 


TGAAACCACG 


TCAGCTTTAT 


CTGCAACCTC 


AAAGCAGTGC 


3960 


TTTGAGCAAC 


TGCGGCTAGC 


TTCCTAGTTT 


GCTCTTTGAT 


TTTCATTGAG 


TATAAGATAC 


4020 


AAATTTATAC 


TAATACATTT 


TGTAAAAAAG 


CGAGTTTCGA 


TTGAAATTCG 


CTTTTTTAAT 


4080 


GATGTAGATT 


TGGGTCAATC 


TTGTCTAAAA 


ATAGGGAAAT 


CCTAGATACA 


GTGAAGGCTT 


4140 
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TAAATGCTGG TTTTTACTCT CCTCAGCCTT ATATTTTTTC GTAGTTGGTT ACCTCATATC 4200 

TATTATATTC GCTTACATAA AGTATTATAA TATAATTGTA CGAAAGAAGG TGTTTTTATG 4260 

ATATACACAC TTAAATTGGT GTTGTTTATT ACCTTTCTTG TAATAAGCTT GTTACCTCAT 4320 

AAGATTTTTG CAAAAAATAA AAAAATTTGG AAAATAGTTT TTGCAATATT GACGGCAGTG 4380 

GCAGCATTGT CATTTATGTA CTAAGTTATT TTAAGAATGT AGGGAAATAA ACCCTACATT 4440 

CTTTTTAGTT TTTTCTGTTT TCTAAATTCT ATTTATCCAA GCGATTCAAC ATTTCTTGCT 4 500 

TCTTCGCTTC AAGTTCTGCA CGCTTTTCTT CGATTTCGGC ATGTTTTTTC TCGAGTTCAG 4560 

AACAACTTGC ACCATTGCTA AATTCTTTTC GCCATCAGGA GATAGGGTCA GTCGACATGT 4620 

CTATTACTCA CCCAAAGCAG TCCTACAAAG CAGGAATTTT CTGTTACTTT TTTGGAAATA 4680 

GTAACGTTTA TACAGCTTTG ACACTTCGTA TCAAAGCGCC AAACACACTC CGAGGGGTTT 474 0 

ACAGAAAGCA GAAAAGGAAT GATCTGGTAT AAGATCATTC CTTTTCyCTC TTTTTCTTTA 4800 

AGTAATTATA TACAATGTAC GACGAAGTCG TCATTGCAAT GCTGATCCAC CACCTAAAGG 4860 

GAACTTTAAA CAACATTGAT AAGATAAAGA ATATAAACAA CGAAAATACG TTATACCCAA 4 920 

TTAATTTTAT TGTATATCTC ATGATTAAAA GTTAATCCTT CCGTTGTTAG GAATGGCATC 4980 

ATTTTTATCC CATAATTGTG CTAAATAAGT CCCCGGTGAT AATAAATTCA TAGCGAATTC 5040 

TAAAGCAACA TCATTTACAA ACCAACTACC TAGATATCTA GAAATTGCTG AACGAATAGC 5100 

ACTTTTTGCT GCATGTTTTC CTTTTACTTT AATTAGATTT GCAAGGCCTG CAGTAGTTCC 5160 

TCCTAATGCT AAAGCTATTG CAGTATCTAA TAGAGCACCC ATTTGATTAA CTGTAATACC 5220 

TTGCCAAACT GCTCTAAATG GAGAGTATGT AGCTGGGATT GTATAATCGC CTTGTAATTG 5280 

TCGGTTAATT ACTTCTTTGA TCCATTGTTG TGAGACGTCT GCATGAAAAG ATTGGATTTC 5340 

GTTTGCAAGT GTATTGATTT GTTCTTCTGT TAGAGAAGTG ACAGGTTGAA GTTCCATATT 5400 

TGTTTCAATT TGTGATACTT GTTCAGAAGC GTATACAGCT GAAACACTTG GAATCGCTGA 5460 

TACAATTAAC ACAATTGACG TCAAAAAAAC CGAAATAAAT TTCATTAATT TGTTCATGAG 5520 

CTTTTCTCCT TTTTATTTGC ATCTGCTTAC ATTTTATCAT ATACTGTTAT TATAGTCAAA 5580 

AAAATATGCT ATTATGTTAA AAAAATATTT TTCAAAATAT AAATGGACGG ATTTATTTTG 5640 

GATTTTATTT GTTATTTTGA CCTGCCTCTA TATTGGTAAC CATGATTTGT TTACTCTCAA 5700 

TCATCAAGAA TTCTCTTTTC GTGGTAGCGT TTCGGGTCTG GTACTGGCCT TATATCACTT 5760 

ACTATTCATT GATAAGTTTG TTATATCGAA TCGAAAATAA AGATTAGAGC TATGCTTGAC 5820 

TGTGTACTTT TAGGATTTAT TTTGGAGGAA GATTTTGTCT CTATTATTTA TTATTTTAAA 5880 

TTTATTTATT TTGTATAAGA TCTATTCTTT 59 10 



WO 98/18931 



PCT/US97/19588 



1067 

<2> INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5406 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
GGCATAGCGA CTCATTTTTT CAACTGTCCA GGCTGGATAC CAGACTAATT 
ATCCGTTACT TCTGGAACCT CTATCATAGC ATCATAAATC TGGTCTGTCA 
TAAGGGACAA CCCATAGTTG TCAAAGTCAT GTCAATCTCT GTTTGCCCTG 
AAAACGAATC TCATAGATCA AACCAAGATT GACAATATCG ATTCCCAACT 
GACTTCTTCC AAGGCTGTTA AAATCCGTGT TTTGATGTTT TCAATTTGCT 
AGCCATATTT TCCTCACTCT TAGTCTTCAA TAAAATCACG AAGCGGTTTG 
GTTGGCGTAG TTTTCTCAAA GCCTTTGCTT CAATCTGACG GATACGCTCA 
TAAAGACTTT CCCCACATCT TCAAGTGTGC GCATTTTTCC ATCATCTAGT 
GACGCAGAAC ATTTTCTTCA CGGTCTGTAA GAGTATCTAA GATTTCATCC 
GCAAGACGAT ACGAGTCCTA TAATCCACTG GATTTTCAAT CACTTCATCT 
CTCCAAGGTG GCTATCGTCC TCTTCACCGA TAGGAGTTTC AAGAGATACT 
CAATCTTCAA GATTTCACGA ACCTTATCAG GTGTCATATC CATTCGTTCA 
CTGGTGTCGG ATCTTGCCCC AATTCTTGAA GGAGATTCCG CTGTTCACGA 
TGATAGTTTC AACCATGTGA ACTGGGATAC GGATGGTACG AGCTTGCTCC 



GAGTGATAGC 

AGTCAAACTT 

ACTGCATACC 

CCGCAAGACG 

CCTCTTCATT 

CATTGACCTT 

CATTGCTGAG 

CCTGAATCCG 

GACTTGCATT 



CTGACGAATC 

GTCAACCGCC 

ACGACCGACA 

TTCTTTCGCT 

GGTCAAGAGA 

AGCAGAAGTT 

AACACGCGCA 

TTGCAAGAGA 

GATTTCATCA 



CACCAAGTTG CATAACTTGA 
TTCATCAAGC CCATATTTCC 
TAGCGTTTGG CAATGGAAAC 
TCGATATCAC CAGCTTCAAC 
GGAACGACCC CTATTTCTTT 
GACCCAATCA AGTCCTCATC 
CTTGGATTTC CTTCGTTATC 
TCTTCAATCC CATCAGCGTC 
TCTGTTGCTG TCCCTTTTTG 



AAACTTGAAC 

TTCTTGAATC 

AACCAAACGA 

AGCCAGTCCC 

CAAGTACATA 

GCTGAGTTCT 

TGTGATAGAA 

CAAGGTAAAA 

CTTATGATTA 



TAACCTCAGT 60 

AAAGGTCTGC 120 

TGTCACCGTC 180 

CAGGGTCGAT 240 

CTTCTGTATA 300 

CTACGACTTG 360 

CGAGTTACGT 420 

CCAAAACGTA 480 

AATTGCTCAC 540 

TCGATAAAGT 600 

GGTTCTTGGG 660 

GCAATCTGTT 720 

ACCAATTTAT 780 

GCAATAGCAC 840 

CCTTTAGAAT 900 

AAGTCAAGGA 960 

AGATTGGCTT 1020 

AACTCTTTCT 1080 

CGGACAGGGT 1140 

GGTTCTTCTT 1200 

ATGCCTGCAT 1260 

GGAATAACCA 1320 

CCGATAAATT 1380 
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CTGCTACCTG TACGTCAAAT GTTGTTACTT CT r TTT G TTT TGTTGCCATT ATTACTCCAT 1440 

TCTTCTCTTT TGGGAAATTA AACGTTCCAA TTCTTCTAGG GCTGTATCTG TATCTCCTAC 1500 

ATGGCTAGCT TCCTGCACCT TCTTTTTGAT TCTCATATTG TCCTGATTCA AGAGAGCCTT 1560 

GTTTCGAGTC ATCTCTACTT CACTAAGTTC CTGCGGCGAT ATCTCAGCAG GCAAATCCTG 1620 

AGCTAAAACT TGGTACCAAG CTCTTTCAAC TTCCTCTGTC TGCTCTGCTA AAACTTCTGG 1680 

AGGAAGATTT CCATACTGGC CAAGCAAGTC ATATAAGACC TGAAATTCAG GTGTAGCAAA 1740 

TGCAAAGTCT TCTCGCAAAC GGTAATCGTT CAAAACAAGA GGGGATTCCA TCATCCGATA 1800 

GAGTAGATGG GCTTCTGCCC TCATAATAGC CGATAACTGC TTGGTGACAG GCATGGTGAT 1860 

TGGCGTCGGT CTGGAAATTC CTTCCATGCG ATTCTGCCTT TGCACCTGAC GACTCTCATT 1920 

AACAATCTGC TCAATCTGGG TATAATCAAA GGACGCCAGA CTGTCAGCTA AAATATGAAT 1980 

ATAGCTGTTT TGAGCAGCGA TGGACTTTTC TTGAACAATC AAGGGAGCTA TTTTTTCAAG 2040 

AAACTCAATC TGAGCCTGCA GATTTTCACT GTTTTCAGGT TTGTACTGAT GAATGTAGAA 2100 

CTCAATCGGA CTAATACGAG TTTTCGTTAA TAGATAGGCC AAGTCTTCTG GACCATTTTT 2160 

TTGTAGATAC TCATCAGGAT CCAAGTTATC AGGCATGCTG ACGATTTGCA CAGGCATATC 2220 

ACCAATTTCA TCCAATGCTT TCAATGTCGC GGCTTGCCCA GCCTTATCTC CATCGTAAAC 2280 

AAGAACCAAT TTCTTGGTTA ACCTTTTCAG ATGCTCAACA TGCTCTCGAC TCAAGGCTGT 2340 

TCCCATCGAC GCCACAGCAT TTTCGATTCC AGCCCGATAG GCTGCAATAA CATCCATGAA 2400 

TCCTTCCATC AGGTAAATCT CACTAGCTTT TCCAGAAGAT CTTTTTGCCC TATCCATATG 2460 

ATATAATTCG TAACTTTTGT TAAAAATTGC AGTCGATCGG CTGTTTTTAT ACTTAGAAGT 2520 

TTGTGAATCC GTTTTTTGCC AGATACGACC TGAGAAGGCA ATCACCTTTC CTTGGTCATT 2580 

TGTCAGGGGA AACATAATGC GATTGTGAAA GGTGTCTACA AATTGATTGG CATCCGAGAG 2640 

ATAAAACAGG CCTGAATCCA GTAAATCCTC TTCACGATAC TGATCAGACA AACGTTGATA 2700 

GAGATAGTTT CGTTCTGGAG GTGCTAAACC AATCCAAAAA TGTTTAAGCA CTTCATCTGT 2760 

CAACCCCCGC TGATAAAGGT AATTTCTGGC CTCTTCGCCC ATAGTCGTTG TCATGAGAAT 2820 

AGCATGGTAA AATTTGGCTG CATCTTCGTG CATATCATAA AGAGCTTGGT GAGGTGAGGC 2880 

TGACTTCTGC TCACTATAAA GCGGTTTTTC AACCTCAATT CCAACACGCT GACCTAAGAT 2940 

TTGGACTGCT TCTATAAAGG GAACCCCTTG GTACTCCTCG ATGAACTTAA AGACATCACC 3000 

TGAGCGACCA CAACCGAAAC AGTGATAAAA CTGCTTGTCC TCTACAACAT TGAAAGATGG 3060 

TGTTTTTTCA CCATGAAAAG GACAGAGCCC TAGATAGTTC CGTCCTGCCT TTTGTAAAGA 3120 

AATCACATCT CCTATGACTT CCACAATGTT GGCATTGTTT TTGATTTCTT CAATGACTTG 3180 
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TTTGTCAACC 


ATACACAATA 


CCTCCATGTT 


ATCATAGTTT 


ACTTTATATA 


GTATACTTTA 


3240 


TTTCAGAAAA 


AAAGTAAACC 


ATTTCACTCA 


TTTTCCCTAC 


TTTATTCAAA 


GAGTTGATAA 


3300 


TAATCAGAGA 


TTTTCATTTT 


TGCTTTTTCT 


TCTTGGTTTA 


AATCTTCGAT 


AATTCGTCCT 


3360 


TCTTTCATGA 


CAATCAAGCG 


ATTGCCCTAT 


TTGAGAGCAT 


CTTCCATATG 


ATGAGTAATC 


3420 


ATAAGGCCTG 


TTAGCTGATC 


TTTCTTAACA 


AATTCATCTG 


TCAATTCCAT 


CAAAGCAACA 


3480 


CTAGTCTTTG 


GATCCAGGGC 


AGCAGTATGC 


TCATCTAACA 


GGAGTAATTC 


AGGTCGCTTC 


3540 


AAGGTTGCCA 


TCAAGAGACT 


CAAAGCCTGT 


CTTTGTCCAC 


CTGATAAGAA 


CTCAATCGGT 


3600 


GTATTCAAGT 


GTTTCTCAAG 


ACCATTTCCT 


ACTTTTTCAA 


TGGTTGCCTG 


AAATTCATCC 


3660 


TTATAGCTAG 


TCAAGCGTCG 


TGGTAACAAT 


CCACGCTTTT 


CACCACGAAA 


CTTGGCGATT 


3720 


AAAAGATTTT 


CAGCGACCGT 


CATACGGGGA 


GCTGTCCCCA 


TCTTTGGATC 


TTGGAAGACA 


3790 


CGAGACAGGT 


ACTTGGCACG 


CTTCTCGGGT 


GAAAACTTAG 


TGAGATCTTC 


ACCTAAAATA 


3840 


CGGATAGTTC 


CACTAGTTAG 


TGATAAGGTC 


CCTGCTATAG 


TGTTAAAGAG 


AGTTGATTTT 


3900 


CCAGCACCAT 


TTCCGCCCAA 


AATCGTGATA 


AACTCCCGTT 


CAAAAATTTC 


TAAGGAAACA 


3960 


TCATTTAAAA 


TAATCTTTTC 


TTCATCAAAG 


CCATTTTTAA 


CGATTTTGGT 


TGCATTTTTT 


4020 


AATTCTACAA 


TTGCTGTCAT 


TTGCTTAACT 


TGGCTCCTTT 


CAAGATTGTT 


TGCTTAAATG 


4080 


TTGGAATCAT 


GAGGCAGACT 


GCTAAAATCA 


AGGCACTGTA 


TAAACGAAGG 


TAACTTGTAT 


4140 


TAAAGCCAAG 


TGCGATAACT 


GCCCACACTA 


AAAATTGATA 


AGCGATAGAA 


CCTACAACGA 


4200 


TAGTAACCAA 


ACGCTCTGCC 


AAGCTCAAAC 


TCTTGAAAAT 


AACTTCTCCA 


ATAATCAAAC 


4260 


TTGCAAGCCC 


CACAACGATA 


ACCCCGATCC 


CTCGAGACAC 


ATCGGCATAA 


CCTTCTTGCT 


4320 


GAGCAATGAG 


GGCACCTGCA 


AGGGCAATCA 


CACCATTTGA 


TAAGACCAAG 


CCCATGAGCT 


4380 


CCATGCGTCC 


AGTATGAATC 


CCGAAACTTC 


TAGCCATATC 


AGGATTATCC 


CCTGTAGCAA 


4440 


TATAGGCTTG 


TCCGAGTTTA 


GTGTCCAAGA 


AAAAGAGCAT 


GAGAGCAATA 


ACAATACTCA 


4500 


CAAAGATGAG 


ACCTGTCAAG 


AGTTGATTCA 


AATCCGAATC 


AAAAGGCAAA 


ACATCCTGAA 


4560 


TTTGCTTGGT 


TCCAAGCAGG 


CCTAAATTCG 


CACGTCCCAT 


AATCAAGAGC 


ATGATTGAGT 


4620 


GACAAGAAGT 


CATCACCAAA 


ATCCCTGAGA 


GCAAGGTTGG 


GATCTTCCCT 


TTTGTATAAA 


4680 


GAAGGCCTGC 


TGCCATTCCA 


GCCAAACAAC 


CTGCTCCTAC 


AGCAACAAGT 


GTCGCTAAAA 


4740 


ATGGGTTCAC 


GCCTTTGGTT 


ATCAAAGTGA 


CAGCAACAGC 


TCCCCCAAGA 


GGGAAGGAAC 


4800 


CTTCTGTCGT 


CATATCTGGA 


AAGTTTAAAA 


TCCTAAATGT 


CATAAAGATT 


CCCAGACCTA 


4860 


GAATAGCCCA 


GACAAATCCT 


TGAGAAATAA 


TGGAAACAAT 


CATATTTTAT 


TTAATCCTTT 


4920 
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CTATATTCAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4980 

GACTTCTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTGA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATGTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC TACCATAGCT GTCGCACTGG GATAAATTGG 5220 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC 5280 

AATTGGAACC CAAATAGCAT CTACCTTGCT AGTCATAACA GTGACAGTTG AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 5406 
r 2 > INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9711 base pairs 
<B> TYPE: nucleic acid 
<C> STRAND EDNESS i double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CAGCTTGCTC TTACTATTAT AGCAGATGTT ATAGCTGGAA TTATCTTGTA TTTCGTCTGC 60 

AAATGGCTAG ATGGTAAGAA GTAGACCGAA TGACTAGCCT ATAAACACCC GTTAAATCGC 120 

TAAGATACGT CAAAAAAGCC CTTAACTATG GCACTAGTTA GGGGCTTTGG TGTTCTAATG 180 

AACCTTATAC ACTAACTACA TTCTAGCATA TAAGCCCAGA TATTTCAAGA GTTTTATTTA 240 

TTGTTTAAAG TTCTGAAAGG TCTATAATGA AGTTAGCCAT CTAGTATCAA AAAACCCACT 300 

AGCTCTTATG AACTAGTCGA TTTCTCATCA ATGCGCCAAC ATTTCTTGGG CGATTTCTTG 360 

GCCAGATAGG TTATCTGGGT AGTAGGTTGG CCAGTTGTCC ATTTCTTCAA AGAGGGCTTC 420 

TTGGCTTGTG CCTCCAAAGA AGATATGGAA ATGTTCTGCC TTAACTGGGG CAACATTCi G 480 

GTCACTAAAC TGAACATACT TGAATTGTCC AGCGTCAGCA TCTGTGGCTT CAAAGAGGAA 540 

ACGCACGCCA CGATTGCCTT TCTTGTAAGT CAAAATTTTC TTACCGACAT ACTTGTAAGT 600 

GTATTTCTTG CTTTGTCCAC CTTGAACAAA TTCCATAGTA TTATCAGTAA TGTTAATCTT 660 

AGTCACATCT GTATGATAGC CTTTTGTATA GTAAGCCTTG TACTCAGCCT GGGTCATCTT 720 

ACCAGTCAAC TTAGCCTTGT AGTCAAAGAC TTGGTCAAAC GTGCCGTCTT CAAGGAAAGG 780 

ATAAACTGAT TGCCAGTTAC CTGCATAGTC ACTCAAGGTG CGGTCCTTGA CAGCTGCATC 840 

CTCGAAGTAA CCATTTTGGA CTGTCTTGGT ATCCTCTGCC TTTTCAGGTT CAATTGCTGG 900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTTT TTCTCCATCA CGGAAATGTA 960 

GTTTTCTCCA GCCTTGGTGT CCTCTTCTGT CAGACTTTCT AAAGGATTGA GGACATCAGT 1020 

TTTGACACCT GCTTCTTTTG AAAGTGTGTT AGCAAGGGCT TGTGAGGCAT TTCTTCAAAA 1080 

TAGATATAGG CGATTTTATT TTTCTTGACA TACTCTGTCA ATTCTGCCAA GCGACCAGCT 1140 

GATGGCTCTG CATCTGGAGA AAGTCCTGAG ATTGCGACTT GTTTGAGTCC ATAGTCCAAG 1200 

GCAAGATAGT TAAAGGCTGC GTGTTGAGTC ACAAAGCTCT TTTGTTTTGC TTGAGACAAA 1260 

CCTTCTGCGT AAGCCTTATC CAAGGCTTGC AATTTTTCGA TATAGGCAGC TGCATTCTTC 1320 

TCAAAGGTCT CTTTTTTATC AGGATAATCT GCTGACAAGC TGTCGCGGAT GTGCTCTACT 1380 

AGTTTAATGG CACGAACTGG TGATAACCAA ACATGGGGGT CAAACTCATG GTGATGACCT 1440 

TCTTCTCCAT GGTCATGGTC TCCCTCTTCT TCCTCGCCAC CTGGCAAGAG CAACATATCG 1500 

CCTGTCGCCT TGATGGTTTT CACTTTTTTC TTATCCAAGG TATCTAGCAA TTTAGGTACC 1560 

CATGTTTCCA TGTTTTCATT TTCATAAACG AAGGTATCTG CATCTTGGAT TTTGGCAACT 1620 

GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAGTTCTACA 1680 

TTAGCCGTAT CTCCTGCGAC TTGCTTGGTA AATTCATAGA CAGGGTAAAA GGTTGTCACG 1740 

ATATTGAGTT TACCATCTGC CPGTTTTTGA TTGGAACAAG CCACTAAAAA CAAGGCACAT 1800 

AGACTGGCTA GTAATAAGCT AATTTTTTTC ACGTTCGTCT CCTATTTGAT AAAACGTCTT 1860 

ACTAAACTGA TTAGTATAAA GACAGTTACA AAAATAATGG TAATACTTGC ACTTGCAGGT 1920 

GTTTCTGCAT AGTAGGAAAT GTAAAGTCCT GCTACCATTC CCAAAAAGCC AATCGCACTG 1980 

GCAAGCAGCA TAACCGATTT AAAGTTTTTC CCCAGACGCA GGGCAATACT AGCTGGCAAG 2040 

ACCATAATGG TCCATACCAG AAGAGCTCCT GCTGCAGGAA TCATAAGGGC AATAGCCACC 2100 

CCTGTCACCA TGTTAAAAAG AATGGACATG GTACGAACTG GCAAGCCATC CACAAAGGCC 2160 

GTATCTTCGT CAAAAGTTAA GATATACATA GGACGAAGAA AGAGAAAGGT CAAAATCAAA 2220 

ACAACCGCCG CAATGACAAA GAGGGAAATG ACCTGTTCTT CACTGATAGT CACGATCGAA 2280 

CCAAAGAGAT ATTGGTCCAA ACTCATTGAA CTCGAGCTTT TACCCTTGCT CATGACAATC 2340 

AGAGAAACAG CCAGACCTGT TGACATGAGC ATAGCTGTCC CGATTTCCAT AAAGCTCTTG 2400 

TAAACCGTAC GGAGATACTC CAGAAAGACC GCCGCAATCA AGACAATGGC AATAGTAGAA 2460 

ACAGTTGGAG AAATCCCCAA AACCAGACCA AAGGCTACAC CTGAAACTGA GACGTGGCTA 2520 

AGGGTATCAC TCATCAAACT CTGACGACGC AAGATGAGGA AGCTTCCCAA TACCGGTGAG 2580 

AAAAGACTCA TAGCAATAAC CGCCAAAAAG GCGCGTTGTA TAAAGTCGTA AGATAATAAA 2640 
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CTAAGCATCG 


CCCACCTCCT 


GGCCATTCTC 


ATGAACATTG 


AAACAACCCC 


ATGGCGAGTC 


2700 


TTGGTTACGG 


ACTAGATGAA 


TATTGCGATC 


CGCATAATCC 


TTAACTTCTT 


CAGGGTCATG 


2760 


GGTAATCATC 


AAAACAGCCT 


TGCCATGATG 


ATGGGCGCTG 


TGGTGCATGA 


GTTCGTAAAA 


2820 


TTCATTTTTA 


CTTCCTGCAT 


CCATCCCCGT 


TGTCGGCTCG 


TCTAGGATAA 


ACACATCAGG 


2880 


GTCACAAGCA 


AACATACGCG 


CAATTACCGC 


TCGCTGCTTT 


TGTCCCCCAG 


ATAGAGACCC 


2940 


CAAGCGTTTG 


TCTCGATGTT 


CCCACATGCC 


AACTGAGTCC 


AGACTAGCCT 


TGATATGCTC 


3000 


CTCATCATGA 


GCATTCAAAC 


GACGGAACCA 


GCCTTTTCTC 


GGATAGCGAC 


CCGACTTGAC 


3060 


AAATTCATAG 


ACCGTACTTG 


GAAAACCAGC 


ATTAAAACTG 


GCAATTTGTT 


GAGGAAGATA 


3120 


GGCTATTCTC 


AATTTCTTAC 


CTTCCGTATT 


TGTCTTTGAA 


ATAGCCACCT 


TTCCAATGCG 


3180 


TGGTTGCAGA 


ATTCCAAGAC 


TAGCCTTGAT 


GAGCGTCGTC 


. TTAGCCGCTC 


CATTTTCCCC 


3240 


AGTCAAGGTA 


ACAAATTCCC 


CACTATCAAC 


ACAATAATTG 


ATATGTTCAA 


GAACAGGCTC 


3300 


CTTATCATAA 


TAGAAGGACA 


AATCCTCTAC 


CGTAATATAT 


CTCATTATTT 


GATTTCTCCT 


3360 


ACTAAAGCAG 


TCAAAAACCG 


CTGAATCACT 


TTTTGTTCAT 


TTGGAGTAAA 


CTGAGTCGCC 


3420 


ACTTGTTCAT 


AGGTTAAAAG 


TGTATGCTCA 


TGGTGATGGT 


GGTGCTCCTC 


AGCGATTGGA 


3480 


CGAGCCAAGT 


CAGTCAACTG 


ATAAAAAATC 


ACACGCGCAT 


CTTTAGAATC 


TTTAGATGTT 


3540 


TCCAACATCC 


CTTCCTTGAC 


CAAAGACTTA 


ATGGCCTTGG 


TAACTGCCGC 


CTGACTGACA 


3600 


TTGAGACGAC 


GGGCCAATTC 


TGAATTTGTT 


AAAGATTCCT 


CTGACAAGAG 


CATAAGGATA 


3660 


TGCTCCTGAG 


TATTGGTCAG 


GGCCACCTCG 


CTAGTGCAAT 


GACCTATTAG 


GATTTCATGC 


3720 


TGATTTTCCG 


CCTGCAAAAT 


CACCTCATTC 


AAAAAAGCAT 


TGATATCCTT 


TGCTAGCTGT 


3780 


CTCATATCTG 


ACTCCTTTCC 


TTTTAGACTT 


CTCTTTTTTA 


AGAGAAAAAT 


ACTATTCTTT 


3840 


GACATTTTGT 


TTACCAGTTA 


ATTATATCAC 


AAGCAAAAAA 


AGAGTCAAGA 


AAAAACGTGA 


3900 


AAACTAGTTT 


CATTCTTGAA 


CTCTTCTATA 


TTATATTATC 


TATTGAAATT 


CTTTGACATC 


3960 


TCCATCATAA 


GTCGCCCAAT 


CTTTGCTGAA 


AAAGCGCTCA 


TTCAGATGGT 


AAGTCGGAGC 


4020 


TGGTGTGGGA 


TTGGATAGGA 


AAGGATCAAC 


TGCCTTGTCA 


AAAGCCAACC 


AACCCAACCA 


4080 


ACCAAGGTGA 


ATGGTGTCCT 


TCATAAAGAA 


AGCCTCCCCG 


CCGTCCTTAG 


AAAAATCTGC 


4140 


TATATTGGTA AAACCTTGAC TTTCTAACTG GTAGCGAATC TTCTGCACCG TTTGTTGGTA 


4200 


CATATCCTCT 


CGTAGACCAG 


CATAGTTCAT 


CCATTTTTTA 


TTAACAGGTG 


GAATGATAAA 


4260 


AATOGGGTTT 


ACCTTAGATT 


TAGAAAACTG 


TGTTAAAACC 


AACTGCAAGT 


CATTATACTC 


4320 


TGGCGACTTG AGATAGGTAA AGCTTTTCTG AGAATCCTTT AATTTCTTCA AATCCTTCTT 


4380 


GATCTGCTCA 


TTATAGAAAT 


AATTTTCCAT 


TCCCATCTCA 


TTATTGGAAG 


TATTTTTTTC 


4440 
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AGCATCTCCT TTGACAACAT CTTCTATTGC CTGATAAGAA AACTGGTCTG GCAAGATTTT 4500 
TAAATACTTA GCTACATGCT TATCGTAGTT AACATAGCCT CTAACCGAAA ACTGACCAAA 4560 
AAAGGAAGCT TGGCGTTCAT TAAAACGAGC CAATAATTCA ATCATTTCAT TGTCTGCTGT 4620 
CGACAATTCT TCTTTACTTG CCAACTTCTG AACCAGGTCC TTCATAGCTA CCTTTCCCAA 4680 
CTGTTGCAGT AAGCGAGTCG CTGCATATTG ACTAGCCTGA TCCCCAGATT GATGTTTCAG 4740 
AAAACTAGTC AACTGGTCTC CATTAAAATA CTGCTGGAAG GCTGCTGGAT CATAGCCATT 4800 
TTTACTGAAC CACTGAGGTG AGATAACATA CACAACTTGT TTATTCTCCA GCTGTGGTAA 4860 
CATCTGTTGC ATTCCAAAAT ATTGGTTAAG CGATGCAGCT CCCCCCTGTC CTAAAAGATA 4 920 

AGGACGGTAG GAACGATTGT ATTTCTCAGC TAATACCGCA GGATGAGCAC CGTCAAAACG 4980 
AAGCCATTCA CTAGAGCCAA AGAAGGGAAC AAAACGCACA TTTGGATCAG ATAGTGCTCT 5040 
GACTTTTTGA CTTCGCTCCT TAAAACTATC GATAGTAGTA GCCACTGCTG AACGCTTTTC 5100 
AGCTCCTAGA TTATGATGCA TCTCAGTAGG ATAAAAGAAA ATGAGCAGAA AAACCAACAA 5160 
ACCAGCGATC AAGACCGGTC CGAAGATCAT CCATAAGCGT TTAAGCATTT TGTAGCTCCA 5220 
CAATACCAGC TATGATTTTA TTAGCTGTAT TCCAGTCCTC ACGACCAAAC TCTGTTACAC 5280 
GGACACGAAT GTCAAAACGG TTCTCAATCT CCACAATCAA CTCAACCGTT CCCATACTAT 5340 

CCAAGACACC TGCATCAAAA AGATCTTCAT CCATCATGTC AGAAACATCT TCCATAAACA 5400 

ACTCATCAAT AATTTCAATA ACTTCTGATT TGATATCCAT ATTTTATTTC CTTTTATTTT 5460 

TTAAACCATA GATTATTCAA GAATCCAGAA AAGATTAAGA ATCACAACAT GACAACATGG 5520 

AAAGTGACAA CCATGCCAAG CAACTGAATC CAGCGATTCT CAGGTAGGGC AGCCTTCCCT 5580 

GCTTTTTTCC GTTCCTTATT GAGCGTTTTT TTCTTGCGAA CCCAGGCATC ATTGATGACC 5640 

AAGCCTAGTC CATGAAAGAG TCCATAGGCG ATATAGTACC AGGTCACACC ATGCCAAAAT 5700 

CCCATAATCA GCATATTTAC AATGTAGGCC ATGCTTGAGG TTACATTACG ATTTTTAAAG 5760 

ACTTTCTTTC TGGTTAACAC CATCACCATT CGCATAAAGA CAAAGTCACG GAACCAGAAG 5820 

GACAGACTCA TATGCCAGCG ATTCCAAAAC TCCTTTAAAT CCCTTGATAA AAAGGGCTTG 5880 

TTAAAGTTGA TAGGGCTACG GATTCCCATC AAGTTTGAGA TGGCCAAAGC AAACATAGAA 5940 

TAACCTGCAA AGTCAAAGAA GAGTTCCAGA CCAAAAGTAT ACATAACTGC CAAGGCATAG 6000 

AGATTAAAGA AGCCACCTGA CTGCAAGGCT AAATTCTTCA GAGGAGGTAG TAAGGTCTCT 6060 

CCTAAAACAT GAGCTAGGAT AAACTTATAC AAAAAGCCCC ACATGATATA GCGGACAGAT 6120 

TCATCCAGCA TATCCATCAA CTCATCTCGC TCAGGAATAG CCTGATAATT TTCATTAAAT 6180 
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CGCTTAAAGC GATCGATTGG ACCACTCGAG AAAGTTGGCA TGAAGAGAAG GAAACGGAGG 6240 

AATTCCCAGA GGGTAAAATC CTTAATCACT CCATCTCTCA GCTCGATGAC AATTCCAACC 6300 

GAACGAAAGG TCAGGTAAGA AATTCCCAAG AACCCAAGCA AAGACTGCGT TCCATTGATA 6360 

GCTGGTTGCA CCTTGACAAA GATAATCGGA AGTAGGGACA GAAAACTAAC TAAGTAGAAG 6420 

ACCCACTTGC CATCCTTGCT TTTTCGATAA TGCTTGTAGA AAAGCAGGAG CAATATTTCC 6480 

CAGCAAAGGT AAATACCCAA GGCAGCTAGT TGATTGGTCT TTCCACCCAC CAACATGGTG 6540 

ACAATAAAGA AGAGACTTAC CAACACTTCA TACCAGGCAA AGCGTTTCTT GAAAAAGAGA 6600 

CCTATAAAGA TGGGCAAGGT TGCAGCAATC ACATAAACAA AATACTGAGG ATTGCCATAT 6660 

GGCTCTAAAT GAGGAAGCTG TTGAAAAAAC TCCATCATCT CTTATTCACC TCGTTAATCA 6720 

ATCCTTTGAT GTCAATCTTT CCATTTGGAG TTAGTGGCAA ACTGTCTCGG TAAAGGAATT 6780 

TAGATGGCAT CATATAGGAC ATCATGATGT CTGTCAGGTC TTCCTTCATG GCCTTGGTAA 6840 

TATCGATATC TCGCTCAAAC TGCTCACGAA CACCGTCTTT TAAGATGACA TAAGCCAATA 6900 

GATTTTGTAC CTTGTGGTCC TTGTTATAGC GCGGTACTGC GACAGCAGAT TCGATAAAGC 6960 

GAGACTTGTT GAGGTTTTGA GAGACATCTT CTAACTCAAT GCGGTAACCG TTAAACTTAA 7020 

TCTGGAAGTC CATGCGTCCG CCGTAGAGAA GCAAGCCCTC ATCTGTCATG GTTCCCACAT 7080 

CGCCTGTGTG ATAGGCTGGC AGATCTTCAA ACTCAAAGAA GGCTTCTGCT GTTTTTTCAG 7140 

GATTGTTCAT ATAACCTTTT GAAACAGCTG GCCCAGAAAC AATGATTTCT CCCTGCTCAC 7200 

CATTTGGCAG TTTATTTCCT TCCTCGTCAA TGATAAAGGT TGGAGAATCA GCCTTGGTAT 7260 

AGCCGATTGG TAGGCGTTTG AGAGTCGCTA ACATCTCGTC TGTCACGGCA ACTGCTGACA 7320 

GAGCTACTGT CGCTTCTGTT GGGCCGTAAG CATTGATGAT ACGGGCATTT GGGAAACGCT 7380 

CGCGCAGTTT TTGAGCTGTT TTGACCGTCA ATTCTTCACC ATCAAAGTAG AAATGCGTGA 7440 

TTCCAGGCAT TTTCTCACTG TTGAAGTATT CAGACAACAT GGCCATATCT GCAAAGGATG 7500 

GTGTTGATGT CCAGATAGCG ATTGGCAATG AAAAGATAGC CGCAAAGAGT TGCTTAAAUT 7560 

CCTGAGTGAT GACTGAAGGA AGAGTGAAAA GCGTACCACC AAGTGCCAAG GTCGGTGCCC 7620 

AATACATGAC AGACAAGTCA AAAGAATAAG GTGGCTGTGC CAGCATTTGC GGACGACTCG 7680 

GTGTCGCAAA TTCCTTATCC GTAATCATCC AGTTTGTAAA GCTGAGGAGA TTATCATGTG 7740 

AAATCTGCAC TCCCTTAGGC TTACCAGTCG TACCAGAAGT AAAGATAATG TAGTAATTAT 7800 

CATCTCCCTT GACTGGATGC GTGATTTCAT AGTTATTCCC TTGGGCAAAG GCTTCTTGAA 7860 

CCTGAGCTAG ATTTATCATT GGTGTAGAAA CCTGCTCCAA GGGAAAGGCT GAAATGGCAA 7920 

TAATCAAGCT TGGCTCTGCT ACTTCTAAAA TAGCTGAAAC TCGCTCCAAG GCCGAATGGC 7980 



WO 98/18931 



PCT/US97/19588 



1075 

TATCAATTGG AATGTAGGCA TGACCTGACT TAGTCAGCGC TACAAAGGTT GCCAACATTT 8040 

CATATTCTTG GCCACCAAAA ACAACCACAG GAGACTTCTC AGGCAAGCCT AGTTGGTCAA 8100 

TGACTGCAGC CAAACTATCC GAATCAGCCT TTAAATCGCC ATAAGTGTGT. TCCTGCCCCA 8160 

AAACATTATA GACAGGATAG CTAGGCTGTG TCTCAGCAAA ATGCTCAATG GTTTCAATCA 8220 

TATCTGCTAT TGGTTTATTT GACACAATAG GGATTCTCCT TCAAGTTAAA ATTCATTATA 8280 

GATAAAGCTT CCTTGACCCT GACCAAGATA GCTAAAGAAG TAAAGCAGCC CTAGAAAGAT 8340 

AAGAAAATAC AAGGCTGTCC GACCAAGAAA GAGGTACAAT TCTTTTCTCT GTTTCATCAA 8400 

GAAAAACCAT TCATTTCTGT AATTTTTCGC TAAAATAAGA GTGATTCTTA CTAGCTTATT 84 60 

TTTCTACCAT TGTACCACTT TATATAGTAT CTTTTCAATT GTTTACCGTA TGTTTCCAAT 8520 

AGATTTCAGC TTATTTTAAG GATTATACAG TTTTTCTATG TATATTTTCA AATAGAGTGA 8580 

TCCTCCTTCA AAACTCCATT TCAGGAGACA ATGAAGTAAA TCTTCCCATA ATAAAACACA 8640 

CAATATCAAG TTTTTTCAAC ACCTGATACT ATGCGCTTTT CTGATTTTTA AAGACTTTTT 8700 

AACCACTCTC TCATTTAAAA TAATCTCGTC TGATATAAAT TAAAATAGCT TCTATCATCA 87 60 

GACAAATGGC TGATAGCCAA AAACTGATGC TAATACCAAA ACTCTCAGTA ATATAGCTCA 8820 

TTAGCAAAAC AAATACTGAA AATGCTAATG TAGAAATCAC TTCAAGAACG G AAT AGACAT 8880 

TAACTAAATG ATTTTCCTCT ACTGTTTCCT GAAGAAATAC ACTTTCAGGA ACTTCTTTTA 8940 

GTTGCGATAA CATACCAACT AAAGCTGAAA ATAATAAAAA CATCTGTGCG TTTGGAAAAT 9000 

ATAGAATAGT CAGTGTCACT ATTTCCATAG CTACAAGAGG AAAAAGAATA CTTTCCCCCC 9060 

AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC TCATGAGCCA 9120 

CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG TTTTTCCTCG 9180 

CTAGATTTCC TCAAAACGGC AGACTCCTCC CTTGGTGCGT CACACGATTT TTTCATCTCG 9240 

ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA AGGAACAGGA 9300 

AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAACTGCTG AAAACAATTC GGAATAGGCA 9360 

TAGAGACTAG AGAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT TCCCACCACG 9420 

TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC TCCAGCTAGA 9480 

TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG AACTTCGTTT 9540 

TTGATTAAGG TTGAACTATC C G TTTTATCG CCAAAAAATC CCTCCTTCAT CTCCTTGATG 9600 

AAATTCTCGG CTTGACCACG TCCACGATAA AGCTGAAACT GGTCTTGGcT gTTCCACTCG 9660 

TCATATTTGT AACGAGAGAA ATAACATCGT AGAACAAGTA TCCTTCTTTT C 9711 
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(2) INFORMATION FOR SEQ ID NO: 16B 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



GATTTCTTCA ATGATCTAAA CAATTTCAGA CCTTCTAGCA ATACTCCTCC 
ACCCAATCAG GTGGAGGTTA CGGTGGAAAC GGCGGTTATG GTTCCCAAAA 



ACTGAAATTG CCCGTCGTGG AGACATTGAC CCCGTTATTG GGCGCGACGA 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

CCCCTTTGTC AAAACTGTAA AATTAACGAC TCAACAATTC ATCTTTACAC CAATCTCAAT 60 

GGAAAACAAA AACAAATTGA CCTCTGTCAA AACTGCTATA AGATTATCAA AACAGATCCT 120 

AACAATAGCC TCTTCAAAGG TATGACGGAT CTGAACAATC GTGACTTCGA TCCCTTTGGT 180 

TATTCCCCCA 240 

TCGTGGATCT 300 

GCTCAAACTC CGCCACCTAG CCAAGAAAAA GGCCTGCTGG AAGAATTTGG TATTAATGTA 360 

TGAGATTATC 420 

CGTGTCATCG AGATTCTCAA TCGTAGAACC AAGAATAATC CTGTCCTTAT CGGTGAACCT 480 

GGTGTCGGAA AAACGGCCGT TGTCGAAGGT CTAGCTCAGA AAATTGTCGA TGGCGATGTG 540 

CCACATAAAC TCCAAGGTAA AGAAGTCATC CGTCTGGATG TGGTTAGCTT AGTTCAAGGA 600 

ACGGGGATTC GAGGACAATT TGAAGAACGC ATGCAAAAAC TCATGGAAGA AATTCGCAAA 660 

CGTGAAGACA TCATCCTCTT TATCGATGAA ATCCATGAAA TTGTTGGTGC TGGTTCTGCG 720 

AGTGATGGTA ATATGGACGC AGGAAATATC CTCAAGCCAG CCCTTGCTCG TGGAGAACTG 780 

CAACTAGTCG GTGCTACTAC CCTCAATGAA TACCGTATCA TTGAAAAGGA TGCTGCCCTC 840 

GAGCGTCGTA TGCAGCCTGT TAAAGTCGAT GAACCAACGG TGGACGAAAC AATCACTATT 900 

CTCAAAGGGA TTCAAAAGAA ATACGAAGAT TACCACCACG TTCAATATAC AGATGCTGCG 960 

ATTGAAGCAG CTGCAACTCT TTCCAATCGC TACATCCAAG ATCGCTTCTT GCCTGACAAG 1020 

GCCATTGACC TCCTAGATGA AGCTGGTTCT AAGATGAACT TGACCTTGAA TTTTGTGGAT 1080 

CCTAAAGTAA TTGATCAGCG CTTGATTGAG GCTGAAAATC TCAAGTCTCA AGCTACACGA 1140 

GAAGAAGATT TTGAGAAGGC GGCCTACTTC CGCGACCAGA TTGCCAAGTA TAAGGAAATG 1200 

CAAAAGAAAA AGATCACAGA CCAGGATACT CCTAGCATCA GCGAGAAAAC TATTGAGCAC 1260 

ATTATCGAGC AGAAAACCAA TATCCCTGTT GGTGATTTGA AAGAGAAAGA ACAATCTCAA 1320 

CTCATCCATC TAGCCGAAGA TCTCAAGTCT CATGTTATTG GTCAAGATGA TGCAGTCGAT 1380 

AAGATTGCCA AGGCTATTCG CCGTAATCGT GTCGGACTTG GTACCCCTAA CCGCCCAATC 1440 
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GGAAGCTTCC TCTTCGTTGG GCCAACTGGT GTCGGTAAGA CAGAACTTTC CAAACAACTG 1500 

GCTATCGAAC TTTTTGGTTC TGCTGATAGT ATCATTCGCT TTGATATGAG TGAATACATG 1560 

GAAAAACATA GTGTGGCTAA GTTGGTCGGC GCTCCTCCAG CTTATGTTGG CTATGATGAG 1620 

GCTGGTCAAT TAACTGAAAA AGTTCGCCAC AATCCATATT CTCTCATCCT TCTCGATGAA 1680 

GTGGAAAAAG CTCACCCAGA TGTTATGCAC ATGTTTCTTC AAGTCTTGGA CGATGGTCGT 1740 

TTGACAGACG GGCAAGGACG CACCGTTAGC TTCAAGGATG CCATCATTAT CATGACCTCA 1800 

AATGCAGGTA CAGGAAAGAC CGAAGCTAGC GTTGGATTTG GTGCTGCTAG AGAAGGACGT I860 

ACCAATTCTG TCCTCGGTGA ACTCGGTAAC TTCTTTAGCC CAGAGTTTAT GAACCGTTTT 1920 

GATGGCATTA TCGAATTTAA GGCTCTCAGC AAGGATAACC TCCTTCAGAT TGTCGAGCTC 1980 

ATGCTAGCAG ATGTTAACAA GCGCCTCTCT AGCAACAACA TTCGTTTGGA TGTAACTGAT 2040 

AAGGTCAAGG AAAAGTTGGT TGACCTAGGT TATGATCCAA AAATGGGAGC ACGCCCAcTT 2100 

CGTCGGACTA TTCAAGACTA TATTGAGGAC ACAATCACTG ACTACTACCT TGAAAATCCA 2160 

AGCGAAAAAG ATCTCAAAGC AGTTATGACT AGCAAGGGAA ACATTCAGAT TAAATCTGCC 2220 

AAAAAAGCTG AAGTTAAAAG TTCTGAAAAA GAAAAATAAA TCCTATAAAA AAGGAGTAGA 2280 

AAATGAAATT TTTCTGCTTC TTTTTTTACT AAAATAACTG TAATTTCTTG ACAGCTTGCC 2340 

CTTTGTCCAT TATGATATAT AGTAGACTGA ATCTGAAATA GTACGAAACA ATTGCTAAAA 2400 

CATTTATAGA AATTAATTTT ACTTTCCCAA TCGATTTGTT CTCATCTTAT TTCAATCTGC 2460 

TATAGTCAAT TGAAACAAGA ACAAGACAAA AGAGCCTCAT AAAAGGTATT GCAACTTGGT 2520 

AATACCTTTT TGAGGTGCTT TTTGATATGA GCCCATGTTT TCTCAATAGG ATTGTACTCA 2580 

GGTGAGTAGG GAGGAAGAGG TAAAAGTTTA TACCCAAACT CTTCACACAA GAGTTCTAAC 2640 

TTACCCATTC TATGGAATCT TGCATTATCC ATAATAATAA CCGATGGTGT GGTTAATGTP 2700 

GGTAAGAGAA ACTTCTGAAA CCAAGCTTCA AAAAAGTCGC TCGTCATCCT CTCTTCGTAA 2760 

GTCATTGGAG CGATTAACTC ACCATTCATT TGTTAGACCT GCAACCAAAG AAATTCTCTG 2820 

ATATCTTCTT CCAGATACTT TGCCTCTTCT TAACTGACCT TTTAATGAGC CACCATATTC 2880 

TCGATAAAAA TAAGTATCGA ATCCTGTTTC GTCAATCTAA ACAGGTGCTA GGTGCTTTAA 2940 

ACTATTAAAA TTCTTAAGAA ATAAGGCTAC TTTTTCTGGG TCTTGTTCAT AGTAGGTGTA 3000 

GTTCTTTTTT TTTTCGAGTG TAGCC 3025 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4104 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO; 


169: 






TTTAAGGTTT TAAAAAAAGT 


TTTCGAAAGG 


TTTCTTCTTT 


ATTTTTTAAG 


GGAGAGATAA 


60 


CGTTGATATC 


TAAATCGTGG 


TCAAAGCCGG 


CAATTTTTCC 


TTTAGATGTG 


TATTGGTGAA 


120 


TATCATAATC 


TAAATCAGTT 


TTAGGACTGC 


TCTCCAAAAA 


TCCTGAGTCT 


GAGCCGTAGA 


180 


CGGAATCCAA 


ACAGAGGTAA 


ACTTGCCTGT 


ATCAATACTG 


TGTTCTTCCA 


TGAAGTAGAC 


240 


ACCAACGTAG 


ATGCCGATGT 


TTTTAGCACC 


CAGTGATGCT 


AGTTTTGCTC 


GAAAGTTTTC 


300 


GACACCTTCG 


TTCATATTAG 


ACATGGTTTT 


GTCTTCCACG 


TCAAGCCAAT 


AGTAACTAGG 


360 


GCTGTAAGGA 


GAGGCAGCAT 


TGTAGAAAAC 


TTCGGCAGCC 


TTTTCCATTT 


CTTGGACACT 


420 


TTTTCCAGCT 


ACATAAGCGT 


AGACAGCAAC 


TGGGACATTC 


CGCTTTTGAA 


GTTCAGTGAT 


480 


ATGACTCTTA 


TAGGCCTTGT 


CTATTCCATT 


GATAAATGAA 


GCATCATTTT 


CTTTTGTCGT 


540 


TTGAGCACCA 


CTGTGAACAC 


GAACAATAGC 


ACCTGAAATA 


TTTTGTGAGA 


GGGCATCGTA 


600 


GTTGATTTCC 


TCAGGACGCT 


GCCAGCCAGA 


GAGGTCAATA 


ATCGGTTTGT 


CTAAGTGTTT 


660 


CAAAGCCTGT 


GCTTCAATCT 


GTGCTATATT 


GGATTTTGTT 


TTAAACGATT 


GGCTGTCATT 


720 


AAGTGGGCGA 


TTCATGATTA 


AAATGAACAT 


CATAATCCCA 


AAAAAACTAA 


ATAAAATAAG 


780 


TGGATGAATT 


TGTTTTCTCA 


TATCTTATAA 


TTCTACCCTA 


AAAATCAAAA 


AAAATCAAAA 


840 


AAATGGGTTA 


AGGAAGAGAC 


TTTAGAGCAT 


TTTTTCATTC 


AAGAGTGCGG 


AATGATTTGA 


900 


AATATGGTAT 


AATAAAAGGG 


AATTTCTACA 






AA l"i~l"l\»CCA 


960 


TTATTTTAGC 


AGCGGGTAAA 


GGGACTCGCA 


TGAAATCTGA 


TTTGCCAAAA 


GTTTTGCACA 


1020 


AGGTTGCGGG 


TATTTCTATG 


TTGGAACATG 


TTTTCCGTAG 


TGTGGGAGCT 


ATCCAACCTG 


1080 


AAAAGACAGT 


AACAGTTGTA 


GGACACAAGG 


CAGAATTGGT 


TGAGGAGGTC 


TTGGCTGGAC 


1140 


AGACAGAATT 


TGTGACTCAA 


TCTGAACAGT 


TGGGAACTGG 


TCATGCAGTT 


ATGATGACAG 


1200 


AGCCTATCTT 


AGAAGGTTTG 


TCAGGACACA 


CCTTGGTCAT 


TGCAGGAGAT 


ACTCCTTTAA 


1260 


TCACTGGTGA 


AAGCTTGAAA 


AACTTGATTG 


ATTTCCATAT 


CAATCATAAA 


AATGTGGCCA 


1320 


CTATCTTGAC 


TGCTGAAACG 


GATAATCCTT 


TTGGTTATGG 


ACGAATTGTT 


CGTAATGACA 


1380 


ATGCTGAGGT 


TCTTCGTATT 


GTTGAGCAGA 


AGGATGCTAC 


AGATTTTGAA 


AAGCAAATCA 


1440 


AGGAAATCAA 


CACTGGAACA 


TACGTCTTTG 


ACAACGAGCG 


TTTGTTTGAG 


GCTTTGAAAA 


1500 


ATATCAATAC 


CAATAACGCT 


CAAGGCGAAT 


ACTATATTAC 


AGACGTCATT 


GGTATTTTCC 


1560 
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GTGAAACTGG TGAAAAAGTT GGCGCTTATA CTTTGAAAGA TTTTGATGAA AGTCTTGGGG 1620 

TAAATGACCG TGTGGCGCTT GCGACAGCTG AGTCAGTTAT GCGTCGTCGC ATCAATCATA 1680 

AACACATGGT CAACGGTGTT AGCTTTGTCA ATCCAGAAGC AACTTATATC GATATTGATG 1740 

TTGAGATTGC TTCGGAAGTT CAAATCGAAG CCAATGTTAC CTTCAAAGGG CAAACGAAAA 1800 

TTGGTGCTGA GACTGTTTTG ACAAACGGTA CTTATGTAGT GGACAGCACT ATCGGAGCAG 1860 

GAGCGGTCAT TACCAATTCT ATGATTGAGG AAAGTAGTGT TGCAGACGGT GTGATAGTCG 1920 

GTCCTTATGC TCACATTCGT CCAAATTCAA GTCTGGGTGC CCAAGTTCAT ATTGGTAACT 1980 

TTGTTGAGGT GAAAGGATCT TCAATCGGTG AGAATACCAA GGCTGGTCAT TTGACTTATA 2040 

TCGGAAACTG TGAAGTGGGA AGCAACGTTA ATTTCGGTGC TGGAACTATT ACAGTCAACT 2100 

ATGACGGCAA AAACAAATAC AAGACAGTCA TTGGAAACAA TGTCTTTGTT GGTTCAAATT 2160 

CAACCATTAT TGCACCAGTA GAACTTGGTG ACAATTCCCT CGTTGGTGCT GGTTCAACTA 2220 

TTACTAAAGA CGTGCCAGCA GATGCTATTG CTATTGGTCG CGGTCGTCAG ATCAATAAAG 2280 

ACGAATATGC AACACGTCTT CCTCATCATC CTAAGAACCA GTAGGAGCCT ATCATGGAGT 2340 

TTGAAGAAAA AACCCTTAGC CGAAAAGAAA TCTATCAAGG ACCAATATTT AAACTGGTCC 2400 

AAGATCAGGT TGAATTACCA GAAGGCAAGG GAACTGCCCA ACGGGATTTG ATTTTCCACA 2460 

ATGGGGCTGT CTGTGTTTTA GCAGTAACGG ATGAACAAAA ACTTATCTTG GTCAAGCAGT 2520 

ACCGCAAAGC TATCGAGCCT GTCTCTTACG AAATTCCAGC CGGAAAATTG GAAGTAGGAG 2580 

AAAACACAGC CCCTGTGCCA GCTGCCCTTC GTGAATTAGA GGAAGAAACA GCCTATACAG 2640 

GGAAATTAGA ACTCTTGTAC GATTTTTATT CAGCTATTGG CTTTTGTAAT GAGAAGTTAA 2700 

AACTATATTT AGCAAGCGAT TTGACAAAAG TGGAAAATCC GCGTCCGCAG GATGAGGATG 2760 

AAACCTTGGA AGTCCTTGAA GTGAGCTTAG AAGAAGCGAA AGAATTAATC CAATCAGGTC 2820 

ATATCTGTGA TGCCAAGACA ATTATGGCTG TTCAGTATTG GGAGTTGCAG AAAAAATAGA 2880 

GGAGGTCAGT ATGGGTAAAT CTTTATTAAC GGATGAAATG ATTGAAAGAG CTAATAGAGG 2940 

CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA TTTTACCAAC 3000 

CTCTTCTTCC CGTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC AGGAAACCTT 3060 

GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGAAAATA CCAAGAGAAA 3120 

TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT TGATTTTGCT 3180 

TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA GGAATTATTG 3240 

CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATGCC CAGGAGCAAG 3300 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCATTGCTTC TCATGAAGTC GTTCTTGTAG 3360 

AAAGTGCAAT TGGTAAGCTC ATGTCTGCTA TGAGTGTGGC GATTTTGGCT GATCATTTCC 3420 

AGGTGGATGC CCTTATTAAT ACGCGTTCAG CTGGGGCAGT AGCAGAAGGT ATCGCTGTTG 3480 

GGGATGTCGT GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 3540 

ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCGA ATCAGACAAA ACCTTTGTTG 3600 

CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGGT TTGATTGCTA 3660 

CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAAGTCC CATTTCCCAG 3720 

AAGTTTTAGC CGTGGAGATG GAGGGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 3780 

TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AACATCTTTT 3840 

TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AGTCTTGTTG ACCTTTTTGA 3900 

AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATGATAAGA TTTAAGTAAA 3960 

GAAAAGCTAG AAAACGTTTC AGAGGATATT ATGAGTATTG AAATGACCGT CAGTGAGATT 4020 

GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 4080 

GAAGACACAG ATAAAAATGA CAAG 4104 
{2) INFORMATION FOR SEQ ID NO; 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CACGGATAGG CTCGGCTTTC ATCAGTCCTC AGGCTGATTT ACTAATAGCA ACTTTCCTCG 60 

ACAAAGTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TTACCCTGAT ACCTTTGCTG 120 

GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGGTAC 180 

AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCCGATCCT TGTGTTGATA 240 

GCCTTTCCCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 300 

AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 360 

GAAAAATCGC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 420 

TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 480 

GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 540 

AATCTCCAAC TGCCACCACT TGACTAGGGT GTTTGATAAA CTTGCGACTC ATATGGGAAA 600 
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l n I \>Vx/\ 1 vnu 


nU LwTL CT v. A 


TGAATTCCOA 


TATCAACAAA 


ft, ft ****** ft ft ft 

AGCACCGAAA 


TCAACAACGT 


660 




i V.U l~l\."TAGC 


*l"I"l~l\»"i"CCAA 


CCACTAAGTC 


CTTGATATCT 


AGGACATCTT 


720 




i»GTGCvSTCAA 


AGGAATCACG 


A ft ft ffftAfff*^*^* ft 

GAAATCTCGA 


CCTGGTTTGA 


GAAGATCTGC 


780 


AA 1\»AT ATCT 


TTAAGAGTTT 


CTGGACCAAG 


GTCTAACTCT 


TGCGCCATTT 


CCTTGACTGA 


840 




AGTTTGCTTT 


gggcttci re 


GTTTAGGTCT 


TTAATATCTA 


AACGTTPGAA 


900 


UAw 1 ILV VTA 


ACTGCAGTGT 


AATTCTCTGG 


GTGAACTCCT 


GTATTATCAA 


GGATATTGCT 


960 




ft At ft ^^^^ % 4 m 

ATACGAAGGA 


AACCAGCAGC 


CTGCTCAAAG 


GCCTTGGCTC 


CCAGACGAGG 


1020 


AACTTTCTTG 


ATTTGGGCGC 


GTGAAGTGAT 


TTTTCCTTCT 


TCCTCGCGGT 


ATTTGACAAT 


1060 


ATTTTCAGAG 


ATAGTTTTGT 


TGAGTCCAGC 


TACGTGTGAA 


AGAAGAGCTG 


GGCTAGCTGT 


1140 


ATTGACATTG 


ACACCAACTT 


GGTTAACCAC 


TGTATCGACA 


ACAAAGTCCA 


GACTCTCAGA 


1200 


TAGTTTCTTC 


TGACTGACAT 


CGTGTTGGTA 


TTGACCGACA 


CCAATTGACT 


TAGGATCGAT 


1260 


TTTGACCAAT 


TCCGCAAGAG 


GATCTTGCAA 


ACGACGGGCG 


ATAGAAATGG 


CAGAGCGTTT 


1320 


TTCAACGGTC 


AAGTCTGGAA 


ACTCCTGACG 


AGCAAGTTCG 


CTGGCAGAAT 


AGACAGAAGC 


1380 


ACCACTTTCA 


TTAACGATAA 


CATAGCTGAC 


TTCAGGGAAA 


TCTTTCAGAA 


CTTCCGCTAC 


1440 


AAAAGCTTCA 


CTTTCACGAC 


TGGCCGTTCC 


ATTTCCAATG 


GCAATAATCT 


CTACACCCTA 


1500 


TTGACCAATT 


AAATCTGCTA 


AATCTTTCTT 


GGCTTCTTCG 


ATTTGACGAG 


CTGATGCTGG 


1560 


TTTAACAGGA 


TAAATAACCT 


GAGTTGTCAG 


CATTTTTCCT 


GTTGCATCCA 


CGACAGCTAG 


1620 


CTTGGCACCT 


GTACGAAAGG 


CTGGGTCAAA 


TCCAAGAACC 


ACGCGCCCTT 


TCAGTGGAGC 


1680 


AACCAAGAGG 


AGATTGCGCA 


GATTGTCAGA 


AAAAAGTTGG 


ATAGCTCCTT 


CTTCAGCTTT 


1740 


CTCAGTTAAT 


TCTGTCCGAA 


TACGACGCTC 


GATAGCAGGC 


AAGACCTTTT 


TCTTAACGGA 


1800 


TTGCTGAACA 


ACTTCATCAA 


TATAAGCATT 


TTTCACCTTG 


AAACGAGTAG 


CAAAGAAGGC 


1860 


AAGAATACGG 


TCCGTCGCAT 


GTTCAAAACC 


GATCTTCAAG 


ACACCAAGTT 


TCTCCCCACG 


1920 


ATTGAGAGCC 


AAGGTACGAT 


AGCCTTGCAT 


AGTTCCAACT 


GTCTCTGAAA 


AATCATAATA 


1980 


AATCTGAAAA 


ACCTGCTTTT 


CATC AAG ACT 


TTCATCCTTG 


GCTTGAGAAG 


TAAGTTTACA 


2040 


GTGTCTCAGC 


ACTTCCTGAT 


AAGTCATAGA 


ACGCAAGGTC 


ACATCTTCCG 


ATAAGGCTTC 


2100 


GACCAAAATA 


TCAACTGCAC 


CGGTCAAGGC 


TTCCTTGCCA 


GTCGCAAATC 


CTTCACAGAC 


2160 


GAACTTTTCA 


GCTTCTTTCT 


CTAAGTCAAC 


TATATTCTGC 


AAAATCAAGC 


GAGCAAGAGG 


2220 


AAAGAGTCCA 


GCTTCACGGG 


CAATGGTTGC 


CTTGGTACGA 


CGCTTTTCCT 


TATAAGGAAG 


2280 


ATAGAGTTCT 


TCAACGTCTG 


CTAATTTTTC 


GGCAACTAAG 


ATAGCTTCTT 


CCAATTCCTT 


2340 
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GGTCAACTTA CCTTGTTCTT GAATCTTAGC TAAGACAGCT TCCTTACGGT CATTGAGATT 2400 

TGTCAGACTT TTATCCAAAT CAATAATAGC CTTAATCGCC ACCTCATCCA GACTACCAGT 2460 

CATGTCCTTG CGATAACGCG CGATAAAGGG AATAGTCGCC CCTTCAGCTG TCAAACTTAG 2520 

AACGGTATCA ATTTGCTTTA ACGTCACTCC CAAATCCTGA GAGATTTTTT CATATTTTTT 2580 

ATCCATAAAT CTATTATACC ACAAGCTAAA CGTTTCAAAT TAACTCGTAG AACATTTAAA 2640 

AAATATGTAG GAAATAGATT TATATGCTAC AGCGCAATAA CTTGCACTTA AAGAGCATTG 2700 

CCACCTTTTT TTAACCAAGC CATGATATCA AAAGTATTTA ATGGATCAGA CATAATAGCC 2760 

AGTTCTGGAA GATGTTCCTG ACCTGGAATA ACACATTGAC TTTTCAAATT TTTATATGGA 2820 

CGATTGACTA AAATTAATTT ATTAGAATAA GGAAGATTAT CCATCTTATT TAAAATTTCT 2880 

TCACTAGCTG AATCTTTATT ATCAAATTTA AAATAAAGAT TATTCCAATT TATGCGTTTT 2940 

TTTCTTTTTT CCCACTTAGT TCGTGCTTCT TCAATACTAG AATAATGTAG AAAATGAATA 3000 

TCTATATCTC CTAAGTGCCC CAAAGGATAA ACTTCATGAG TCCAGCTCGG TGAAATAAGT 3060 

TCCTCTTCGA AAACAAGTTC TTGTTCCATA TAATAACGAA AATGCTTTGT AAGTTTATAA 3120 

TAATCATCAG GAAGAATAAA TAAACCAACA AAAGGTGTTC TATATTGAAA ACCAAGCTGT 3180 

TTATAAATTA ATCCTCCAAC ACAATTATTA CTTATAATCG TAAAATCTAA TCTATCAAGC 3240 

TCAAGAAAAG GGAAAATTCC TTTCTCTGCA GCTATTAACT TATGATAAAC AATATCAGAA 3300 

TCTAAATATT CACCGTCATT TTTTAACCAA GCACTAAAAT TTGCCAATTC TTGAATATAT 3360 

TGTTTTTTCG CTCTTTCTAT ATCATAGTTT TCTAAGACGG CGCAATCTTT GATTCTATTT 3420 

TCATAATTTT CTAATATGAT TTTGTAGGAG TCTTTTAGAG GTTTAGCATC TATAACAGGT 3480 

TTATAGATAT ATGTCGGGAA ATTAATATAG GTTGCAGTTT TAGAGTGAAT ATAAAGTCTC 3540 

CAAATAAGGT TGTTTATATC AAATTGATTT ATTTTTCGTA AAAGCTTACT ATTGAATAAT 3600 

TTTCCAAATA ATGAGCGATA TTGTTTTCTA ATTCGATGAT CTGTATCATC CATCTTTTGT 3660 

AAAACTTGAA CATTCGTTAA ATTTTCTGTC AACCAATTAT CCCCCCAAAA AGGATAAAnG 3720 

TAAAATACTC CATCAACCAA ATCAGCAAAA TGACCAAGAA CAACATCAGA ATCGGATAAT 3780 

TTTATCGCAT GATACATCTT TTCAAATGTC CAATCAAATA ATGAATCATT TGAAGATAGA 3840 

AACGTAATAT AATCTCCTGT AATCATATCA GACAACTCAG CAAAAGAATT CTCATCTATA 3900 

ATCTTAATAT TAAATGATAG ATTCATCTGT TGCCTAATGG AAGCTATCTC CTCTGTAGAT 3960 

TGATTTACAA TAATAACTTC TATATCTTTT AATGTTTGTC TCTCCACTAT TGACAAAGAC 4020 

TCTAATAAAC TATTTTTATC TCCTTGATGT AACAAAACAA CACTAATTGA GTAAGTCAGT 4080 

TTGACTACCT CCCATAATTT TCTGATAATG ATTTTCTTTT TATTTAATTA TAGCACAATT 4140 
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ATGATATATA TCAGGTAATA TCAAGCTATA TTATCTCTTA GCTACTCAAT TTGAAATTTT 
AACTTTTCCC TTTTCCGCAA AATAATAGTA TAATAGAGGT AGAATCTAGA ATCGAGGTAC 
ACCTATGGCT GTCAAATTTA CAAAACGAGA CGACTTGGAC AAGATGTTTG AAGAGTTTGC 
TAAACTCCCT GATTTGAAAC AAGTTACTTT CCCTGATGAC AAAGAGAAAA AAGTCAAAGC 
AGAAAAGAAA AACTAGATGA CTGCTTTTCA ACAACTCCCA TCTAGTGTAC TTCAAACTGG 
AGCCATTTTT CTCTCCATTA TCATTGAAGC CCTTCCCTTC GTTCTGATAG GAAGCATTGT 
CTCAGGGCTG ATTGAAGTTT ATATCACACC TGACAAGGTT TATCATTTTC TCCCTCGAAA 
TCGTTGGGGG AGAATCTTTT TTGGGACCTT TGTCGGTATA CTTTTCCCTT CTTGTGAATG 
TGGAATCGTC CCCATCATCA ATCGTTTTCT GGAAAAAAAG GTTCCAAGTT ACACGGCCGT 
TCCTTTTCTT GTGACAGCAC CTGTTATCAA TCCCATTGTT CTTTTTGCGA CCTATTCTCC 
CTTTGGCAAC TCCTTCCATG TCGCCCTATT ACGAGCTCTG GGTTCCATTC TTGTGGCTGT 
AATACTACGA ATTTTTCTAG GATTTTTCTG GCAAGAACCG ATTCAGAAAG AAAATCGTCT 
GGCTTGTCAT GAGCATGATT TTTCTTACTT GAGTTCTGCA AAAAAAGTTT TTCAAGTCTT 
TGTGCAGGCC ATTGATGAAT TTTTTGATAC GGGGCGTTAT TTGGTATTTG GCTGCCTCTT 
TGCTTCTATA ATACAGGTCT ACGTTCCGAC TCGGATTCTG ACCTCTATCA GTGCGACCCC 
TCTTTTTGCC ATCCTGCTCT TGATGATTTT AGCCTTTCTT CTTTCGCTCT GTAGTGAGGC 
GGATGCCTTT ATAGGTGCTT CTCTTCTCTC GAGTTTCGGT TTGGCACCAG TTCTGGCCTT 
TCTCGTCATT GGTCCAATGC TGGATATCAA AAATATTCTC ATGATGAAAA ATTACTTGAA 
AGCACGATTT ATCAGTCACT TCATAACAAT TGTAACTCTT GTCGTCTTAG TCTATTCTCT 
CTTGATTGGA GTTATCCTAT GATTCGATTT TTAGTTTTAG CTGGCTATTT TGAACTGACT 
ATTTACCTCC ATCTGTCGGG CAAACTAAAC CAGTACATCA ACATGCACTA TTCCTATCTG 
GCCTATATCT CCATGGTGCT TTCTTTTATC TTGGCTATCG TTCAATTGTA TATCTGGATG 
AAGCAAGTCA AAACCCACAG TCATCTGAAC AGCCGATTAG CCAAGATAAC GAGTATTTCT 
CTTCTGGCTA TTCCACTTGT CATCGGCTTA ACTTTCCCAA CTGTTAGCTT GGATTCTCAG 
ACTGTTTCTG CTAAAGGTTA TCATTTCCCC CTATCGGAAG GAACGCATCT AGCCATTCAG 
ACAAGCGAAG GGACGACAAG CCAATATTTG AAACCAGATA CCAGTTCTTA TTTTTCAAAA 
TCAGCCTATG AAAAGGAAAT GCGAACGGCG GCGGATAAAT ACTTATCCCA AGATAGTATT 
CAGATCACTA ATGAAAACTA TATGGAAGTC ATGGAGGCTA TCTACGACTA TCCAGATGAG 
TTTGAGGGCA AGACAATCCA GTTTACAGGC TTTGTCTATA ACGACCCCAG TCATGCCAAT 



4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 
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AGTCAATTTC 


TGTTCCGATT 


CGGCATTATC 


CACTGTATCG 


CAGATTCTGG 


TGTCTATGGA 


5940 


TTGCTGACCA 


AGGGCAATAC 


CCGGCAGTAT 


GAAAACAACA 


CTTGGATAAC 


AGCCAAAGGA 


6000 


AAACTGCTCA 


ATCACTACCA 


TAAAGAACTC 


AAACAAAACC 


TTCCAACCTT 


GGAAATCGAC 


6060 


ACCTTTACCA AAGTCGATAA ACCAGAAAAT 


CCCTATGTAT 


ATAGAGCTTT 


TTAAGAAAAT 


6120 

V A> 4> V 


CAAGATAAAA 


ACGAACAAGT 


TCTCTTCTGA 


ATAACAGAAA 


AAGAGCCTGT 


TCGTTTTTTG 

• 4 4 4 4 4 4 4# 


6180 

V Ai v V 


TTATATGAAA 


ATTAGTGACT 


TGTAGATTTT 


CATCTTATAC 


CATTCCCAGC 


AATACAAGTA 


6240 

V 4a ^ V 


GCTCATAGAA 


AATAAGCGAG 


CCACTCATTC 


ATTAGACTAG 


CGATTTCTTT 


AGGTGCTTGA 


6300 


GTATAAAGCT 


CATGGCCAAA 


GTTTTCTAAA 


AAAATAGTAT 


CAAAATAGTC 


TGGCAATTCT 


6360 

W J W w 


TTTACGGCTT 


CCTCTCTCCA 


TGTAGCTTCA 


TTAGGATAGC 


GAGGACTAAT 


AAACAAGCTA 


6420 


TCTCCCACTT 


CTCTCTTAAA 


AGCTTGTATT 


TTTCTCCGTA 

4 4-4 4* 4 4^^*»^# 4 ■ • 


GcGGAGTATC 

W4» ^*T4F4*4J A *• A Vb 


CCT Tf*T AT AT 

VV 4 4 *v 4 *» A *V 4 




TTT C AT AATT 


TATAGCCAAC 


TCATATCTAT 


TATACTCAAC 


ATTCCAGTGA 


TAAGACTGTC 


6^40 

O J i# u 


TTACAGCTTT 


CTCCATATTT 


TCTGACCAAT 


GCTTTGCTTC 


AGATTTTTCT 


TTAGAAGTAA 


ODvU 


GAACATCTAA 


GTCCGAAACA 


ATTTGAGATT 


TGATATAATT 


TTTAGTPTTf^P 

* * * *»4J 4 * A \^ «^ 


i w i nsiv, lulu 


DOOVI 


TATCCAAAGG 


TAAAATCTTA 


TCTAAATCTA 


GATAGCCACC 


ATCCAAAAGA 


ATPAtVTTTCT 




TTACTTCTTC 


AAATTCCGAT 


GCGAAATAAC 


GAGCTAAATC 


TCCTCCAAGA 


GAATCflfPTA 
win a uu^\< i r\ 




TCAGACAGAT 


AGATTCTTCC TCTACAATTT 


CATTTTTAAA 


CCATGATTTC 


nflA 4 vAUl 4 A 


6840 


CATCTCGAAG 


ATGCTTTTCA 


TATGGATTTA 


GAAAATAGAC 


CTGCGAATCT 


AGTTCTTGAA 


6900 


GAAAATCCTT 


GCTATGATAG 


GCATTGCTTC 


CCAAACCGCC 


AATAAAATAT 


» 1 * A 1 VJI i 1 V 


070U 


TCTACTTAAT 


ACTATGCTTA 


TTCATCTTTT 


GTTCAAAGAT 


AGTTGTGATA 


ATCTGACGCA 


7020 

» W4> W 


ATTCTTCGCG 


TTTTTTTTCT 


GGAATCTCAC 


CACTTGTTTG 


AGCTACAGCG 

• v» 4 • 4^^4^4^4p» 


TAGAGTTCAG 


7080 


GGTATTCAAT 


TGAAATGCGT 


TTAATCGTAC 


GTGTTGTAGC 


ATGTTTTCTG 


ACAAAAAACG 


7140 


GGATTCGCTT 


AATCAAGTCT 


TGTGGGACTA 


GCGCCAGAAT 


CTTCTCAGTA 


GTTTCTTTGT 

4# 4 4 4 4* 4 4 4 4 


7200 

# 4* V V 


CACTAATATT 


AGACATTGTA 


AGCCTTTTCT 


TAATCATTTC 


CTGTTCTTTT 


TCTGTAAAAT 


7260 


CTTTTAATTC 


CATTCGATTA 


GTCCTCCTAT 


TTTCTCTAAG 


TTAAATTATG 


TACTAATACA 


7320 


GATGAAACTA 


CAAAGAATAA 


ACTTTAAGAA 


ATCTTCTCAC 


TGATAAGATT 


TTAGCATTAG 


7380 


ACTTCCTGCG 


AAACAAAATA 


TGGTATAGTA 


GTTCTATGAA 


TTATGAAGCA 


AGTAAACAAC 


7440 


TAACTGATGC 


ACGATTTAAA 


cgtcttgttu 


GTGTTCAGCG 


CACGACTTTT 


GAAGAGATAT 


7500 


TAGCTGTATT 


AAAAACAGCT 


TATCAACTTA 


AACACGCAAA 


AGGTGGACGA 


AAACCTAAAT 


7560 


TAAGCCTAGA 


AGACCTTCTT 


ATGGCCACTC 


TTCAATATGT 


GCGAGAATAC 


CGCACTTATG 


7620 


AAGAAATTGC 


GGCTGATTTT 


GGTATTCACG 


AAAGCAACTT 


AATCCGTCGG 


AGCCAATGGG 


7680 
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TTTAAGTAAC TCTTGTTCAA AGTGGTGTTA CGATTTCAAC AACTCCTCTC AGTTCTGAGG 7740 

ACACGCTAAT GATTGATAGC CATTCCCATC AATATCGTAT CTTTGGACAT AGCCAATAAA 7800 

TGTTTCATTT TTGCGTGGTT TCTGCCTATT AACGATTGAA ■ ATAACCCACC AACTTATCAA 7860 

AAATAGAAAT AAAAATCCTA ACATTACTGT CATATCATAA C ACT ATT AAA GTTTAACCCA 7920 

CTTATCATTA TCCATGATAA AAGGCTTAGC CAGTCCCTCC CCTGTATAAT CCGCATACTT 7980 

OGTGCCCAAA TACTTGTAGC AATCTTCCTT ACTAGCAAAT TTAATCGCTT GGTAGGGCTC 8040 

TTCGAAAGTC AATTTCTCTA CAAATAAGAA ACCGTCATCA GCAGGTACTA AGACCCCAAC 8100 

GTGGCCTACA AACAGATACT CGCCATCCAA ATTGTCGTGC AAGACTACAG ACAGCATTCG 8160 

AGCTTTTTCA TTGAATTGAA ATTGTGAGAA GAATGCTTCC ATCTTTTCAG CGTGAACCTT 8220 

GACATCTGTA GTTGACTCAG TTGGAACTCT CGAAAATAGA ATATCAAACT CTTCCTTATC 8280 

TTGTGAATCA AAGACCTTTC CTTTATCAAT CGCATCATTA TCTAGGAAAA GCAACTGGTC 8340 

ATTCTTTTCA AGCTTTGGAA TGGTCACTGA ATTTTTCAAA AGACAATAAC TATTGATACG 8400 

GCAGTTGGTC CCAACAAAAT CGCCCTTCTT TTGATTCCAG AGATGACTGA TTTTCTCAAC 8460 

ATCGTATTCG GTGTGAGTAA AGGAAGTGAA ATCTCCTGAT AAGCCAGTTG AGCCGACAAT 8520 

GGTATTATAG TCATTAACGA GATTAAAAAA TGCATCAACA CTATTTGGAT CCAAGTGAGC 8580 

TGATAAGAGA GATTTGACCT CTTCTGTACT TACCTGGTTG TTTAGGTTGG TGTATGAAGC 8640 

TTTCCATGGA ACTTTCGCTG AACTGCTTTG CCTTTGATTC GTCCCCTCAG AAGTAGCATG 8700 

TTGTTGTTGA CAAGCAGCCA AGCCTAAAAA CAAGGCTGAA CAGATTCCTA ATGTGGCTAA 8760 

TTTTCTTGAT TTCTTCATTT CTTTCTCCTA AATGTCTTGG ATTAAAGTTT CTTPAACTAT 8820 

TGCTTTACAG ATATTGATTA CTTTCTCATT TAATGTGTTC ATCGTCTTTC CTCCGG 8876 
(2) INFORMATION FOR SEQ ID NO: 171: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14736 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CGCAAACTTT CGCGGTCGGA AGGTAGTTTT ATGACACGAT TTGAGATACG AGATGATTTC 60 

TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 120 

CCAGAGGATT GGTATCATTC GCTCTATAAC TTGAAGGCTC TTGGTTTTAA TACGGTAGAG 180 
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ACTTATGTTG 


CTTGGAATTT 


ACACGAGCCT 


TGTGAAGGTG 


AGTTTCATTT 


TGAAGGTGAT 


240 


CTGGATTTAG 


AGAAATTTCT 


CCAAATAGCG 


CAGGATTTGG 


GTCTCTACGC 


AATTGTGCGT 


300 


CCCTCTCCAT 


TTATCTGTGC 


GGAATGGGAA 


TTCGGTGGCT 


TACCAGCTTG 


GCTCTTGACC 


360 


AAGAACATGC 


GAATTCGCTC 


ATCCGACCCA 


CCATATATCG 


AGGCAGTTGG 


TCGCTACTAT 


420 


GATCAGTTAT 


TGCCAAGACT 


GGTGCCTCGT 


TTGTTGGACA 


ATGGTGGCAA 


TATTCTCATG 


480 


ATGCAGGTTG 


AAAATGAGTA 


TGGTTCTTAC 


GGAGAAGATA 


AGGCTTACCT 


GACAGCGATT 


540 


CGACAGCTAA 


TGGAAGAGTG 


TGGCGTAACC 


TGTCCCCTCT 


TTACATCAGA 


TGGTCCATGG 


600 


CGAGCTACTC 


TCAAAGCTGG 


AACCTTAATT 


GAAGAGGACC 


TCTTTGTAAC 


AGGAAACTTT 


660 


GGTTCTAAGG 


CACCTTACAA 


CTTTTCGCAG 


ATGCAGGAAT 


TCTTTGATGA 


ACATGGTAAG 


720 


AAATGGCCAC 


TCATGTGTAT 


GGAGTTCTGG 


GATGGTTGGT 


TCAATCGCTG 


GAAAGAACCG 


7B0 


ATTATCACAC 


GGGATCCTAA 


GGAATTGGCA 


GATGCAGTTC 


GAGAGGTTTT 


GGAACAAGGC 


840 


TCTATCAATC 


TTTACATGTT 


CCACGGTGGT 


ACAAACTTTG 


GTTTCATGAA 


TGGTTGCTCA 


900 


GCTCGAGGAA 


CTTTGGACCT 


GCCACAAGTT 


ACGTCTTATG 


ATTACGATGC 


CCTTCTGGAT 


960 


GAAGAAGGAA 


ATCCAACTGC 


TAAATATCTT 


GCAGTCAAGA 


AGATGATGGC 


AACACATTTT 


1020 


TCAGAGTATC 


CGCAGTTGGA 


ACCACTCTAC 


AAAGAGAGTA 


TGGAGTTGGA 


TGCTATTCCA 


1080 


CTAGTTGAAA 


AAGTTTCTTT 


GTTTGAAACC 


TTAGATAGCT 


TGTCAAGTCC 


TGTAGAAAGT 


1140 


CTCTATCCTC 


AAAAGATGGA 


GGAGCTGGGA 


CAAAGTTATG 


GCTACCTACT 


TTATCGAACA 


1200 


GAAACAAACT 


GGGATGCAGA 


AGAAGAAAGA 


CTTCGTATCA 


TTGATGGTCG 


AGATAGGGCC 


1260 


CAGCTGTATG 


TCGATGGTCA 


GTGGGTTAAA 


ACTCAATATC 


AGAGAGAGAT 


TGGGGAAGAT 


1320 


ATTTTTTATC 


AAGGTAAAAA 


GAAAGGGCTA 


TCTAGGTTAG 


ATATCTTGAT 


AGAAAATATG 


1380 


GGCCGTUTCA 


ACTATGGGCA 


TAAGTTCTTA 


GCGGATACGC 


AACGTAAGGG 


AATTCGGACA 


1440 


GGGGTCTGTA 


AGGATCTGCA 


TTTCTTACTA 


AACTGGAAAC 


ACTATCCACT 


CCCACTAGAC 


1500 


AATCCTGAGA 


AAATTGATTT 


TTCAAAAGGA 


TGGACTCAAG 


GACAACCAGC 


CTTTTACGCT 


1560 


TATGACTTTA CAGTCGAAGA GCCAAAAGAT ACTTACCTAG ACTTGTCTGA GTTTGGTAAG 


.1620 


GGGGTTGCCT 


TTGTCAATGG 


GCAGAATCTA 


GGACGTTTTT 


GGAACGTTGG 


CCCAACTCTC 


1680 


TCACTTTATA 


TCCCTCATAG 


CTATCTCAAG 


GAAGGTGCCA 


ACCGCATCAT 


TATCTTTGAA 


1740 


ACAGAAGGTC 


AATATAAAGA 


AGAGATTCAT 


TTAACTCGTA 


AACCTACACT 


AAAACATATA 


1800 


AAGGGGGAAA 


ACTTATGACA 


ATTGTAGGAT 


GCCGTATTGA 


TGGACGTTTG 


ATCCACGGAC 


1860 


AAGTAGCCAA 


TCTTTGGGCT 


GGAAAACTAA 


ATGTTTCACG 


CATTATGGTT 


GTAGACGACG 


1920 


AAGTTGTCAA 


CAACGATATT 


GAAAAGAGTG 


GTTTGAAACT 


TGCGACACCA 


CCAGGTGTGA 


1980 
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AATTGAGTAT TTTGCCAGTT GAGAAAGCTG CAGCCAATAT TCTTGGTGGC AAATACGATA 2040 

GCCAACGTCT CTTTATCGTG GCTCGTAAAC CAGACCGCTT CCTTGGTTTG GTAGAAGCAG 2100 

GTGTACCACT TGAAACCCTT AATGTTGGGA ATATGTCTCA AACACCAGAA ACTCGTTCTA 2160 

TTACACGTTC TATCAACGTA GTAGACAAGG ATGTGGAAGA CTTCCACAAA CTGGCAGAAA 2220 

AAGGTGTTAA ACTTACTGCT CAGATGGTTC CAAATGATCC AATTTCAGAC TTTTTGAGCT 2280 

TATTAAAATA GGAAAAAAAT TTTTAGGAGG TCATTGTTAT GATACAATGG TGGCAAATTT 2340 

TACTTCTCAC TTTGTACTCA GCTTATCAAA TCTGTGATGA GTTGACGATC GTTTCATCTG 2400 

CAGGTTCCCC TGTATTTGCT GGTTTCATTA CTGGTTTAAT CATGGGAGAT GTGACTACTG 2460 

GTTTACTTAT CGGTGGTAAC TTGCAACTGT TCGTTCTTGG GGTTGGTACC TTCGGTGGTG 2520 

CTTCTCGTAT CGACGCAACT TCTGGTGCGG TTCTTGCGAC ACCTTCTCTG TTTCACAAGG 2580 

AATTGATGCA CCGCTTGCCA TTACTACAAT CGCTGTACCA GTAGCAGCTC TCTTGACTTA 2640 

CTTCGACGTT CTTGGTCGTA TGACTACTAC CTTCTTCGCT CACCGTGTGG ATGCTGCAAT 2700 

CGAACGCTTT GACTATAAAG GTATTGAACG CAACTACTTG CTTGGTGCGA TTCCGTGGGC 2760 

TCTATCTCGT GCCCTTCCAG TCTTCTTTGC CCTTGCTTTT GGTGGTGCCT TTGTACAATC 2820 

AGTAGTAGAC TTCGTTGAAG CCTACAAATG GGTTGCAGAT GGCTTGACAC TTGCAGGACG 2880 

TATGCTTCCA GGTCTTGGAT TTGCAATCTT GCTTCGTTAC CTTCCAGTTA AACGTAACCT 2940 

TCACTACCTT GCTATGGGAT TTGGTTTGAC AGCTATGTTG ACTGTTCTTT ACTCATATGT 3000 

AACAGGTCTT GGTGGCGCTG TTGCTGGTAT CGTAGGTACT CTTCCTGCTG AAGTTGCTGA 3060 

AAAAATTGGT TTCGTGAACA ACTTCAAAGG TTTGTCTATG ATTGGTATTT CTATCGTAGG 3120 

TATTTTCCTT GCAGTGCTTC ACTTCAAAAA TAGCCAAAAA GTAGCTGTAG CAGCACCTTC 3180 

TACACCATCA GAAAGTGGGG AAATCGAAGA TGACGAATTC TAATTACAAA CTTACAAAAG 3240 

AAGATTTTAA TCAAATCAAC AAACGTAGCT TGTTTACTTT CCAATTAGGT TGGAACTACG 3300 

AACGTATGCA AGCTTCTGGT TACCTTTACA TGATCTTGCC TCAGTTGCGT AAAATGTATG 3360 

GTGATGGAAC TCCTGAATTG AAAGAAATGA TGAAAGTTCA TACTCAATTC TTCAATACTT 3420 

CACCATTCTT CCATACCATT ATCGCTGGTT TTGACCTTGC CATGGAAGAA AAAGATGGTG 3480 

TAGGTTCAAA AGACGCCGTT AACGGTATCA AGACAGGTTT GATGGGACCA TTCGCTCCTC 3540 

TTGGGGATAC AATCTTTGGT TCACTTGTAC CTGCTATCAT GGGGTCAGTC GCAGCAACTA 3600 

TGGCTATCGC TGGCCAACCT TGGGGGATCT TCCTTTGGAT TGCAGTTGCA GTAGCGTATG 3660 

ACATCTTCCG TTGGAAACAG TTGGAATTTG CTTACAAAGA AGGGGTTAAC CTTATCAACA 3720 
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ACATGCAAAG 


TACCTTGACA 


GCTTTGATTG 


ACGCTGCATC 


TGTACTTGGT 


GTCTTCATGA 


3780 


TGGGTGCTCT 


TGTAGCAACA 


GTGATTAACT 


TTGAAATTTC 


TTACAAGTTG 


CCAATCGGTG 


3840 


AAAAGATGAT 


TGATTTCCAA 


GACATCTTGA 


ACCAAATCTT 


CCCACGTTTG 


CTTCCAGCAA 


3900 


TCTTTACTGC 


CTTTATCTTC 


TGGTTGCTTG 


GTAAGAAAGG 


TATGAACTCT 


ACTAAAGCTA 


3960 


TCGGTATTAT 


TATCGTACTT 


GCTTTGGCTC 


'PTTCTGCCCT 


TGGTCACTTT 


GCACTTGGAA 


4020 


TGTAATTCCT 


TATGACTAAA 


TCATTAATTT 


TGGTGAGCCA 


TGGTCGCTTC 


TGTGAGGAGC 


4080 


TTAGAGGTAG 


CACAGAAATG 


ATTATGGGCC 


CACAAGACAA 


CATTTACACA 


GTAGCTCTTC 


4140 


TTCCAGAAGA 


TGGCCCAGAA 


GAATTTACTG 


CTAAATTTGA 


AGCTGTTATT 


GAAGGATTGG 


4200 


ATGATTTCCT 


AGTCTTTGCG 


GATCTTCTCG 


GTGGGACACC 


TTGTAATGTG 


GTGAGTCGCT 


4260 


TGATCATGGA 


AGGTCGTGAT 


ATTGACCTTT 


ACGCAGGGAT 


GAATCTTCCA 


ATGGTGATTG 


4320 


AATTTATCAA 


TGCGAGCCTT 


ACAGGCGCAG 


ATGCGGACTA 


CAAGAGCCGT 


GCTGCAGAAA 


4380 


GCATTGTGAA 


AGTTAATGAC 


CTGTTAGCGG 


GCTTCGATGA 


TGACGAAGAT 


GAATAATACT 


4440 


CTTCGAAAAT 


CTCTTCAAAC 


TACGTCAACG 


TCGCCTTGCC 


GTAGgTATAT GTTACTGACT 


4500 


TCGTCAGTCT 


TATCCGGCAA 


CCTCAAAACG 


GTGTTTTGAG 


CTGACTTCGT 


CAGTCTTATC 


4560 


CGGCAACCTC 


AAAGCAGTGC 


TTTGAGCAGC 


CTGCGGCTAG 


TTTCCTACAG 


ATTTTAGTTG 


4620 


GAACTCGATT 


CAATTCATGT 


GACAACGTGA AAATCGTTAG 


AGCATTTTAT 


ATAGAATATA 


4680 


CATGGGAATG 


TAGCTTACTC 


CCATTCCCAT 


ATTTAATAGA AAAAGAGGAA 


CTCAATGCTA 


4740 


CATTATACAA 


AAGAAGACTT 


GCTCGAATTG 


GGTGCAGAAA 


TCACTACGCG 


TGAAATCTAC 


4800 


CAACAGCCTG 


ATGTATGGAG 


AGAAGCTTTT 


GAATTTTATC 


AAGCAAAACG 


TGAAGAAATT 


4860 


GCAGCCTTCC 


TACAAGAAAT 


CGCTGATAAA 


CATGACTATA 


TTAAGGTTAT 


CTTGACAGGT 


4920 


GCTGGGACTT 


CTGCTTATGT 


GGGAGATACC 


TTGCTACCTT 


ATTTTAAGGA 


AGTCTATGAC 


4980 


GAACGCAAAT 


GGAATTTCAA 


TGCTATTGCG 


ACAACAGATA 


TCGTTGCCAA 


TCCAGCAACC 


5040 


TATTTGAAAA 


AAGATGTGGC 


AACTGTCCTP 


GTGTCTTTTG 


CTCGTAGTGG 


GAATTCGCCT 


5100 


GAAAGTTTGG 


CGACTGTTGA 


TTTGGCCAAA 


TCCTTGGTGG 


ATGAGCTTTA 


TCAAGTGACG 


5160 


ATTACTTGTG 


CAGCAGATGG 


TAAATTGGCT 


CTTCAAGCTC 


ACGGTGATGA 


TCGTAATCTC 


5220 


TTGCTCTTGC 


AACCAGCTGT 


CTCTAATGAT 


GCTGGATTTG 


CCATGACTTC 


TAGCTTTACG 


5280 


TCTATGATGT 


TGACAACTCT 


CTTGGTCTTT 


GATCCTACAG 


AATTTGCTGT 


TAAGTCTGAA 


5340 


CGTTTTGAAG 


TTGTATCTAG 


TCTTGCCCGT 


AAAGTTTTAG 


ACAAGGCAGA 


AGATGTCAAA 


5400 


GAGCTCGTTG 


ATTTAGACTT 


TAACCGTGTC 


ATCTATCTAG 


GCGCTGGTCC 


TTTCTTTGGA 


5460 


CTTGCTCATG 


AAGCTCAGCT 


CAAGATTTTG 


GAATTAACTG 


CTGGTCAAGT 


TGCGACCATG 


5520 
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TATGAAAGCC CAGTTGGCTT CCGTCACGGT CCAAAATCTC TTATCAACGA CAATACAGTT 5580 

GTTTTGGTCT TTGGTACAAC GACAGACTAC ACTCGTAAGT ACGACTTGGA CTTGGTTCGT 5640 

GAAGTTGCTG GTGACCAGAT TGCTCGTCGT GTTCTGCTTT TGACTGATCA AGCTTTTGGT 5700 

CTTGAAAATG TCAAACAAGT GGCCCTTGGT TGTGGCGGTG TCTTGAATGA TATTTACCGT 5760 

GTCTTCCCTT ACATCGTTTA TGCCCAACTC TTTCCTTTAT TGACTTCACT CAAGGTAGAA- 5820 

AATAAACCAG ATACACCGTC TCCTACAGGT ACAGTAAACC GTCTAGTACA AGGTGTCATA 5880 

ATTCACGAAT ATCAAAACTA AGACAGTGTT TATGAATTCT TGACAAGAGG ATTTGTAAAT 5940 

TATCAGATAA ACCATAGATT GTCAGTACGC TTTCTATGGT TTGTTTGCTT GAGAGAAATA 6000 

GTAAAAGGAG AACAGAATGA AAGCATACAC AGAGCGTGTA TTTGGAAATG TTGAGGGTGA 6060 

GGATGTCTTG GCCTATCGAT TTGAGACAGA CGGTGGCTAC CAACTTGAGG TTATGACTTA 6120 

TGCTGCGACT ATCTTGCGCT ATGTCGCACC TGACAAGGCT GGAAATTTTG CCAATGTTAT 6180 

CTTGGGATTT GATGACTTTG ATAGTTATGT AGGCAATAGT CCCAAGCATG GAGCAAGTGT 6240 

AGGTCCTGTA GCGGGTCGTA TTGCAGGTGC GACCTTTGAG CTCAATGGTA AGACCTATGA 6300 

CCTTGAGGTT AATAATGCTA GCAACTGTAA TCACAGTGGT TCAACTGGTT GGGATTCCAG 6360 

CTTGTTTGAA GTTGAAGAAG TAAGCGATCA TGGCTTGACT CTCTACACAG AGCGTACAGA 6420 

TGGGACAGGA GGGTTCCCTG GAAATCTCAA GATTTGGATC AGTTATCACT TGGAAGAAAC 6480 

TGGTGCCTAT GAAATCAGCT ACAAGGTAAC GACCGATCAG GATACGCTGG TCAATCCAAC 6540 

CAACCACAGC TATTTCAACT TGTCTGGTGA TTTCACGCAG ACGATTGACC GTCATGTCTT 6600 

CCAACTAAAC ACAGAGGGCA TTTACTCAAT CGCTCCTGAC GGTGTTCCTG CCAAAACTCC 6660 

AGAAGCCAAC CGTGATGTGG TCAAACACGT CTACAATGGT ACCTTGTTGA AGGATATCTT 6720 

TGCAGAAGAA GATGAGCAAA TCCAGCTGGC ATCAGGTTTG GATCATCCAT TTGCCCTTCC 6780 

TGCAGGCCAT GACAATGCTG GATTCCTTTA TGACCAAAAT TCAGGTCGCT TCCTGCTTTT 6840 

CAAGACAGAA GCTCCTTGCT TTGTGGTCTA CACAGCAAAC TTTGTGGATG AAAGTGTCAT 6900 

CATAGGAGGT CAGCCAATGC TACAGCACAA TGGGATTGCT CTTGAAGCGC AAGCTTTACC 6960 

AGATGCCATT CACAGTGACC TTAAAGGCCA AGTCATTCTT AAAGCTGGTC AAACCTTCAC 7020 

CAGTAAGACA CGTTATGAAC TTGTTGTGAA GTAAAAGAGT CATTGCGCCT ACTTTTGGGA 7080 

GCTAGGAATA GGTACGCAGA GACAAATAGT AGGAAAATAT GATATAACTA AGCGTTGAAA 7140 

GCTATCTGTT AATATAATAT TCAAACTACA ATAAGGAGTA AGAAAGAAAC GAAGAAAATT 7200 

GTATTTGCTA GTGCCTTGGC TTTGACCTTG GCTGGAGCAG TTTTGACAAA TGATGTTTTT 7260 
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GCGAACGACA GACTTGTGGC AACACAAACT ACTGATGGTA AAAATGAAAA TGTATTGACC 7320 

TCAGAGGTGC TAAAACCTTC TAGTGGCAAT GTTTTGGTTG GAATCAAAGG AGAATTTGTG 73B0 

GCTCCTCATC AACAATCTAT TTTGGATGCC ATCAATGCTA TCTGTAAAGA AGCGGCTGAC 7440 

GAAGGTTTGG TAGATAAGTA TGTCCCTATC AAATGATCAA CTGACCTAGA AAAGGCAGCT 7500 

TTTGCCAGAG CTACAGAAGC ATCTATAACC ATGGATCATA CCCGTCTTTC TAGCAAAGAT 7560 

CTTTGGAGTG CCTTTCCAAC TTCTAATAGT ATAATGGGAG AAAATTTGCC ATGGAATCAT 7620 

GACGGTTTTC TAAAAGCTAT TGAACAATGG CGTGCTGAAA AAGCAGATTA TGTGGAGAAA 7680 

AAAATAGTGG TTCAGACAAC GGCAAATCTG GTCACTATGA GTCGCTAATT AACCCTAAAT 7740 

TTACACACAT GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA 7800 

TTGCTCAAAC TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG .7860 

CTGTTCAGTG TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG 7920 

TTGTAGAAAA ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG 7980 

AATCTAATCG TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA 8040 

CAGATGGTAA ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT 8100 

TTTCTGGTAG CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG 8160 

ATGGTAGCAG ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG 8220 

AAAATGGCAC TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG 8280 

TCGGACCACA CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC 8340 

CAGATGGTTA CCGTGTAAAT GGTAATGGTG AATGGGTAAA CTAGGCTCAG GCCATAGGTA 8400 

AAGCATTCAT CTTACTTAGC AAAAAGAATG AACGATAAGA AAGAGGTTGA TGGCGAACAT 8460 

TGGCCTCTTT TGATTTATAA AGATTGGATT CTTGTCGCCT CAATTTCAGA CTTTTCTATT 8520 

GTAAGCTAAT ATTTTATAGC CCATTAAAAG CATAAGCGGT AATCTAATTT AAAAAATGCT 8560 

GTAATTAGTC TGAAGTCCAC ACTTACTTGT TGAGATGTTA TCTCTGTTTT TTATCGTTA? 8640 

AATTTACTGT ATTTTTTATA GTATGCAGAA TATTTTTAAG TATATTTCAA TACAAATTTC 9700 

TATCGATTTA TTGTATAATG ATAAGTAATT GTTGAAAAGT ACTCAGAAAA TTCCATACTA 8760 

TATTATTTTT ATGTTTATAC TTTTATGCTA TAAAATATAG ATTGATATAA AGAATATAGA 8820 

AAAAGCGAGG TTAATATGAG CCGAAAAAGC ATTGGTGAGA AACGCCATAG TTTCTCGATG 8880 

AGAAAGTTGT CAGTGGGATT GGTATCAGTT ACTGTATCTA GTTTCTTTTT GATGAGTCAA 8940 

GGGATTCAAT CGCTATCGGC CGATAATATG GAAAGTCCAA TTCATTATAA GTATATGACC 9000 

GACGGTAAAT TGACAGACGA GGAAAAATCC TTGCTGGTAG AGGCCCTTCC ACAACTGGCT 9060 
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GAAGAATCAG 


ATGATACTTA 




H*A T Afi &TTTP 


ft ACAnTTTTT 


ACCGAATACA 


9120 


GtiTTTTAACC 


CAACTGTTCG 


tactttcctt 


TTTACTGCAG 


fi A1**TYI Aftf^TT 
i i unu^> i • 


\# 4 1 4 4 4 * n 


9180 


TTGGTTTCTA 


AAAGGGAAAA 


TGGAAAGAAA 


CCACTTGTTC 


Ml 1 1 1 \» 1 1 


RTTGACTAGC 


9240 


ATGGGAGTTC 


AATTGTTGCC 


GGCCAGTGCT 


TTTGGGTTGA 


LwnuCvHun l 




9300 

•7 J W W 


TATAATAGTC 


AGCTTTCTAT 


CGGAGTCGGG 


GAACATTTAC 


LAumAiV 1 \- 1 


fiAAJUVTC&AA 


9360 


GGTTATCAAT 


ATATTGGTTA 


TATCAAAACT 


AAGAAACAGG 






9420 


ACAGTTGATG 


GGAAATACTC 


TGCTCAAAGA 


GATAGTCAAC 


f A & ACTf^* Af* 


AAAAAPATCA 


9480 


GATGTAGTTC 


ATTCAGCTGA 


TTTAGAATGG 


AACCAAGGAC 




*P ft A O A A 
1 flu 111 rt«W 


9540 


GGTGAAGCAT 


CAGGGGATGA TGGACTTTCA GAAAAATCTT 


Llnl AGC AU4. 


Af*APAA1*tf*TA 


9600 

«7 \J V V 


TCTTCTAATG 


ATTCATTCGC 


AAGTCAAGTT 


GAGCAGAATC 


CAiGATGAl_AA 


& W A r* A A*IV**P 


9660 

7 O U W 


GTAGTTCGAC 


CAACAGTGCC 


AGAACAAGGA 


AATCCTGTGT 


CTGCTACAAC 


GG rbUUjAVi 1 


Q720 


GCGGAAGAGG 


AAGTATTGGC 


GACGACAAAT 


GATCGACCAG 


AGTATAAACT 




47110 


ACCAAAGGCA 


CGCAAGAACC 


CGGTCATGAG 


GGTGAAGCCG 


CAGTCCGTGA 


AGAlJTTA\.l-A 




GTCTACACTA 


AGCCACTAGA 


AACCAAAGGT 


ACACAAGGAC 


CCGGACATGA 


AGGTGAAGl, 1 




GCAGTTCGCG 


AGGAAGAACC 


AGCTTACACA GAACCGTTAG 


CAACGAAAGG 


CALGCAAuAu 




CCAGGTCATG 


AGGGCAAAGC 


TACAGTCCGC 


GAAGAGACTC 


TAGAGTACAC 


GGAA(_v~v~vii A 


1,V/U A »/ 


GCGACAAAAG 


GCACACAAGA ACCCGAACAT GAGGGCGAAg 


CGGCAGTAGA 


AuAAuAAL 1 1 


IVVOv 


CCGGCTTTAG 


AGGTCACTAC 


ACGAAATAGA 


ACGGAAATCC 


AGAATAI 


1 TATnLnnl. A. 




GAAGAAATTC 


AGGATCCAAC 


ACTTCTGAAA 


AATCGTCGTA 


AGATTGAACG 


iri A/VWAA 


10200 


GCAGGGACAC 


GTACAATTCA 


ATATGAAGAC 


TACATCGTAA 


ATGGTAATGT 


LAj 1 ALaAAAl- 1 


1 0260 


AAAGAAGTGT 


CACGAACTGA 


AGTAGCTCCG 


GTCAACGAAG 


TCGM I AAAG1 


ft/V"ft ir , ft*"*ft* 


10320 

A W J & V 


GTGAAAGTTA 


AACCTACAGT 


AGAAATTACA 


AACTTAACAA 


AAGTTGAGAA 


CAAAAAATCT 


10360 


ATAACTGTAA 


GTTATAACTT 


AATAGACACT 


ACCTCAGCAT 


ATGTTTCTGC 


AAAAACGCAA 


10440 


GTTTTCCATG 


GAGACAAGCT 


AGTTAAAGAG 


GTGGATATAG 


AAAATCCTGC 


CAAAGAGCAA 


10500 


GTAATATCAG 


GTTTAGATTA 


CTACACACCG 


TATACAGTTA 


AAACACACCT 


AACTTATAAT 


10560 


TTGGGTGAAA 


ATAATGAGGA 


AAATACTGAA 


ACATCAACTC 


AAGATTTCCA 


ATTAGAGTAT 


10620 


AAGAAAATAG 


AGATTAAAGA 


TATTGATTCA 


GTAGAATTAT 


ACGGTAAAGA 


AAATGATCGT 


10680 


TATCGTAGAT 


ATTTAAGTCT 


AAGTGAAGCG 


CCGACTGATA 


CGGCTAAATA 


CTTTGTAAAA 


10740 


GTGAAATCAG 


ATCGCTTCAA 


AGAAATGTAC 


CTACCTGTAA 


AATCTATTAC 


AGAAAATACG 


10800 
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CATGGAACGT ATAAAGTCAC GGTAGCCGTT GATCAACTTG TCGAAGAAGG TACAGACGGT 10860 

TACAAAGATG ATTACACATT TACTGTAGCT AAATCTAAAG CAGAGCAACC AGGAGTTTAC 10920 

ACATCCTTTA AACAGCTGGT AACAGCCATG CAAAGCAATC TGTCTGGTGT CTATACATTG 10980 

GCTTCAGATA TGACCGCAGA TGAGGTGAGC TTAGGCGATA AGCAGACAAG TTATCTCACA 11040 

GGTGCATTTA CAGGGAGCTT GATCGGTTCT GATGGAACAA AATCGTATGC CATTTATGAT 11100 

TTGAAGAAAC CATTATTTGA TACATTAAAT GGTGCTACAG TTAGAGATTT GGATATTAAA 11160 

ACTGTTTCTG CTGATAGTAA AGAAAATGTC CCAGCGCTGG CGAAGGCAGC GAATAGCGCG 11220 

AATATTAATA ATGTTGCAGT AGAAGGAAAA ATCTCAGGTG CGAAATCTGT TGCGGGATTA 11280 

GTAGCGAGCG CAACAAATAC AGTGATAGAA AACAGCTCGT TTACAGGGAA ACTTATCGCA 11340 

AATCACCAGG ACAGTAATAA AAATGATACT GGAGGAATAG TAGGTAATAT AACAGGAAAT 11400 

AGTTCGAGAG TTAATAAAGT TAGGGTAGAT GCCTTAATCT CTACTAATGC ACGCAATAAT 114 60 

AACCAAACAG CTGGAGGGAT AGTAGGTAGA TTAGAAAATG GTGCATTGAT ATCTAATTCG 11520 

CTTGCTACTG GAGAAATACG AAATGGTCAA GGATATTCTA GAGTCGGAGG AATAGTAGGA 11580 

TCTACGTGGC AAAACGGTCG AGTAAATAAT GTTGTGAGTA ACGTAGATGT TGGAGATGGT 11640 

TATGTTATCA CCGGTGATCA ATACGCAGCA GCAGATGTGA AAAATGCAAG TACATCAGTT 11700 

GATAATAGAA AAGCAGACAG ATTCGCTACA AAATTATCAA AAGACCAAAT AGACGCGAAA 11760 

GTTGCTGATT ATGGAATCAC AGTAACTCTT GATGATACTG GGCAAGATTT AAAACGTAAT 11820 

CTAAGAGAAG TTGATTATAC AAGACTAAAT AAAGCAGAAG CTGAAAGAAA AGTAGCTTAT 11880 

AGCAACATAG AAAAACTGAT GCCATTCTAC AATAAAGACC TAGTAGTTCA CTATGGTAAC 11940 

AAAGTAGCGA CAACAGATAA ACTTTACACT ACAGAATTGT TAGATGTTGT GCCGATGAAA 12000 

GATGATGAAG TAGTAACGGA TATTAATAAT AAGAAAAATT CAATAAATAA AGTTATGTTA 12060 

CATTTCAAAG ATAATACAGT AGAATACCTA GATGTAAGAT TCAAAGAAAA CTTCATAAAC 12120 

AGTCAAGTAA TCGAATACAA TGTTACAGGA AAAGAATATA TATTCACACC AGAAGCATTT 12180 

GTTTCAGACT ATACAGCGAT AACGAATAAC GTACTAAGCG ACTTGCAAAA TGTAACACTT 12240 

AACTCAGAAG CTACTAAAAA AGTACTAGGA GCAGCGAATG ATGCAGCCTT AGATAACCTA 12300 

TACTTAGATA GACAATTTGA AGAAGTTAAA GCTAATATAG CAGAACACCT AAGAAAAGTA 12360 

TTAGCGATGG ATAAATCAAT CAATACTACA GGAGACGGTG TAGTTGAATA CGTAAGTGAG 12420 

AAAATCAAAA ATAACAAAGA AGCATTTATG CTAGGTCTTA CTTATATGAA CCGTTGGTAC 12480 

GATATTAATT ATGGTAAAAT GAATACAAAA GATTTATCTA CGTACAAGTT TGACTTTAAC 12540 

GGAAATAATG AGACTTCAAC GTTGGATACT ATTGTCGCAT TAGGAAATAG TGGACTAGAT 12600 
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AACCTGAGAG CTTCAAATAC TGTAGGTTTA TATGCGAATA AACTTGCATC GGTAAAAGGA 12660 

GAAGATTCAG TCTTTGACTT CGTAGAAGCG TATAGAAAAC TGTTCTTACC AAACAAAACA 12720 

AATAACGAGT GGTTTAAAGA AAATACAAAG GCATATATAG TCGAAATGAA GTCTCATATT 12780 

GCAGAAGTAC GAGAAAAACA AGAATCACCA ACAGCCGATA GAAAATATTC ATTAGGACTT 12B40 

TACGATAGAA TATCAGCACC AAGTTGGGGG CATAAGAGTA TGTTATTACC ACTACTAACT 12900 

TTACCTGAAG AATCTGTGTA TATTTCATCG AATATGTCTA CACTTGCATT CGGTTCGTAT 12960 

GAAAGATATC GTGATAGTGT GGATGGAGTT ATTCTTTCAG GAGATGCTTT ACGAACTTAT 13020 

GTAAGAAATA GAGTTGATAT AGCAGCGAAA AGGCATAGAG ACCATTATGA TATTTGGTAC 13080 

AATCTTCTTG ACAGTGCTTC AAAAGAAAAA CTTTTCCGTT CTGTGATAGT TTATGATGGA 13140 

TTCAATGTAA AAGATGAGAC AGGAAGAACT TATTGGGCAA GGTTAACGGA TAAAAACATC 13200 

GGCTCTATTA AAGAATTCTT CGGACCTGTT GGGAAATGGT ATGAGTATAA TAGTAGTGCA 13260 

GGAGCGTATC CGyAtGGAAG TTTAACGCAC TTTGTGTTAG AT AG ATT ATT AGATGCTTAT 13320 

GGAACGTCGG TTTATACTCA TGAAATGGTT CATAATTCTG ATTCTGCAAT CTACTTTGAA 13380 

GGAAATGGTA GACGTGAAGG ATTGGGAGCG GAGTTATACG CACTTGGTTT ACTGCAATCT 13440 

GTAGATAGTG TAAATTCTCA TATTTTAGCT TTAAATACGT TATATAAAGC AGAAAAAGAT 13500 

GATTTGAATA GATTGCATAC ATATAATCCG GTCGAACGTT TCGATTCGGA TGAGGCGCTT 13560 

CAAAGTTATA TGCATGGATC ATATGATGTA ATGTATACAC TTGATGCGAT GGAAGCAAAA 13620 

GCGATATTAG CTCAAAATAA TGATGTTAAG AAAAAATGGT TTAGAAAAAT AGAAAATTAT 13680 

TACGTTCGTG ATACTAGACA TAATAAAGAT ACACATGCAG GAAATAAAGT CCGTCCATTA 13740 

ACAGATGAAG AAGTAGCTAA CTTAACATCG TTAAACTCAT TAATCGACAA CGACATCATA 13800 

AATAGACGTA CCTATGATCA TAGTAGAGAA TATAAACGAA ATGGCTACTA TACTATAACT 13860 

ATGTTCTCTC CTGTATACGC AGCGCTAAGC AATTCGAAAG GTGCTCCTGG AGATATTATG 13920 

TTTAGAAAAA TAGCTTATGA ATTACTTGCG GAAAAAGGTT ATCACAAAGG ATTCCTACCT 13980 

TATGTTTCTA ATCAGTACGG AGCAGAAGCA TTTGCCAGCG GAAGCAAAAC ATTCTCATCA 14040 

TGGCATGGAA GAGATGTTGC TTTAGTGACA GATGATTTAG TATTTAAGAA AGTATTCAAT 14100 

CGTGAGTACT CATCATGGGC TGATTTCAAA AAAGCAATGT TTAAACAACG TATAGATAAA 14160 

CAAGATAATC TGAAACCAAT AACAATTCAA TACGAATTAG GTAATCCTAA TAGTACAAAA 14220 

GAAGTAACTA TAACAACGGC TGCACAAATG CAACAATTAA TTAATGAAGC GGCTGCGAAA 14280 

GATATTACTA ATATAGATCG TGCAACGAGT CATACCCCAG CAAGTTGGGT GCATTTATTA 14340 
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AAACAAAAAA TCTATAATGC ATATCTTCGC ACTACAGATG ACTTTAGAAA TTCTATATAT 14400 

AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT GTAAGGATGA GGAGTCAGAT 14460 

GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT TTGATTGAAT GCAGATTGCA 14520 

GGAATCATCT TCAACTCATC AACGACCAAT GGTGACAAGG TGGATTTCAA TCCCACAGAA 14580 

AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT AAATACAAAA CAATCCTAGA 14640 

AGATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT CAAGTTGTCT GGCTTGACTT 14700 

TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 14736 



(2) INFORMATION FOR SEQ ID NO: 172: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



ACAGGAAAGC 


ACGATAGCAA 


TCTCTTTGGA 


AGATTTAAAA 


AATATTCCTC 


AAAGTTTCGC 


60 


TGTTGCTTAC 


GGTGATACGA 


AAGTATCTTC 


GATTCTCTCT 


GTCTTGCGTG 


CTAATTTAGT 


120 


AAATCATTTG 


ATTACAGACA 


AAAATACAAT 


TTTAAAAGTT 


TTGGAAGAAG 


ATGGGGATTT 


180 


GACTTTTAGA 


GAGATTCTAG 


GTGAGTGAAA 


ATGATAGACT 


GATTCAGTTT 


ATCUTTTTTC 


240 


TTTTTAGTTG 


ATTGCACATT 


TGTGCTTATA 


TAAACAAAAA 


TAGTTTATCT 


GTTGTTTTTG 


300 


GATTGACAAC 


TTTATTATGT 


AGTTGTATTC 


TATAGTTACA 


AAAGAAAATT 


TTAAAATTTC 


360 


AAATGAAAAA 


AGCTTTTTAC 


ATAGTGAAAT 


GAGGAGGAAT 


TTATGGAAAT 


GATTGTTCCA 


420 


GATCAAATTA 


TCATGGGTTT 


AATTTTATAT 


GCTGGTGATG 


CGAAACAACA 


TATTTATAAA 


480 


GCGTTAGATT 


ACATAAAAAA 


TGGTACATGT 


GAACGGTGTG 


AAGAAGAAAT 


ACAGTTAGCT 


540 


GATGCAGCCT 


TATTAGAAGC 


TCATAATCTA 


CAAACAAAAT 


TTTTGGCACA 


GGAAGCGTCT 


600 


GGTACAAAGA 


CAGAAATTAC 


AGCTCTCTTT 


GTTCATTCAC 


AAGATCATCT 


CATGACCAGT 


660 


ATGACGGAGA 


TTAATTTAAT 


CAAAGAAATT 


ATTAGTTTGA 


GAAAAGAACT 


TCATAAAAAA 


720 


TAATACTAGA 


GTATTATCAT 


TGTTATTAAC 


ATAGAGGAGG 


AAAACATAAT 


GGTGAAGATT 


780 




GTGCAGCAGG 


TTTTTCTACT 


GGTATGCTTG 


TAAATAATAT 


GAAAATTGCA 


840 


GCGCAATCTA 


GTGGAGTTGA 


GGCAGAAATA 


GAGGCGTTTP 


CTCAGTCTAA 


ATTAGCGGAT 


900 


TATGCGCCAA 


ATATAGATGT 


TGCACTATTG 


GGTCCACAAG 


TTGCTTATAC 


ATTAGATAAA 


960 


TCAAAAGAAA 


TTTGTGATAA 


GTGTGATGTT 


CCGATAGCTG 


TTATTCCGAT GATGGACTAT 


1020 
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GGTATGTTAG 


ATGGGAAAAA 


AGTATTAGAT 


TTGGCCCTAT 


CTTTGATTAG 


TGGGTAAGAA 


1080 


AAGGAGATTT 


ATTATGTCAA 


AGATGGATGT 


TCAGAAAATC 


ATTGCACCGA 


TGATGAAGTT 


1140 


TGTGAATATG 


CGTGGCATTA TAGCTCTAAA 


AGATGGGATG 


TTAGCAATTT 


TGCCATTGAC 


1200 


AGTAGTTGGT 


AUTTTGTTCT 


TGATTATGGG 


ACAATTGCCG 


TTCGAAGGAT 


TAAATAAGAG 


1260 


CATTGCTAGT 


gtttttguag 


CTAATTGGAC 


AGAGCCGTTT 


ATGCAAGTAT 


ATTCAGGAAC 


1320 


TTTTGCTATT 


ATGGGTCTAA 


TTTCTTGTTT 


TTCAATTGCC 


TATTCTTATG 


CTAAGAATAG 


1380 


CGGAGTAGAG 


GCTTTACCAG 


CTGGAGTTCT 


ATCTGTATCT 


GCATTCTTTA 


TTTTGCTAAG 


1440 


ATCATCTTAT 


ATCCCTAAAC 


AAGGTGAGGC 


GATTGGGGAC 


GCTATTAGTA 


AAGTTTGGTT 


1500 


TGGAGGCCAA 


GGAATTATCG 


GTGCTATCAT 


TATAGGTTTG 


GTAGTAGGAA 


GTATTTATAC 


1560 


CTTCTTTATA 


AAGAGAAAAA 


TTGTTATTAA 


GATGCCAGAA 


CAAGTTCCAC 


AAGCTATTGC 


1620 


CAAACAGTTT 


GAAGCAATGA 


TTCCAGCATT 


TGTAATTTTC 


TTATCTTCTA 


TGATTGTATA 


1680 


TATTTTAGCG 


AAGTCATTGA 


CTAATGGCGG 


AACATTCATA 


GAAATGATTT 


ATTCTGCTAT 


1740 


TCAAGTTCCG 


TTGCAAGGTT 


TAACTGGATC 


TTTGTATGGT 


GCTATTGGAA 


TTGCATTCTT 


1300 


TATATCATTT 


TTGTGGTGGT 


TTGGTGTTCA 


TGGGCAATCG 


GTAGTAAATG 


GAGTAGTGAC 


1860 


AGCTCTGCTT 


TTATCTAATC 


TTGATGCTAA 


TAAAGCTATG 


TTAGCCTCTG 


CTAATCTATC 


1920 


ATTAGAAAAT 


GGTGCACATA 


TTGTTACTCA ACAATTTTTA 


GATTCATTTT 


TAATTCTATC 


1980 


AGCTTCAGGG 


ATTACGTTTG 


GTCTTGTAGT 


TGCCATGCTT 


TTTGCAGCAA 


AATCAAAACA 


2040 


ATACCAAGCC 


TTAGGAAAAG 


TTGCAGCTTT 


TCCAGCAATA 


TTTAACGTAA 


ATGAGCCAGT 


2100 


TGTATTTGGA 


TTTCCGATTG 


TCATGAATCC 


AGTTATGTTT 


GTACCTTTCA 


TTCTTGTTCC 


2160 


TGTACTTGCA 


GCTGTGATAG 


TATATGGAGC 


TATTGCAACA 


GGTTTCATGC 


AGCCATTCTC 


2220 


AGGGGTAACA 


TTGCCTTGGA 


GTACACCAGC 


TATTTTATCA 


GGATTTTTGG 


TGGGTGGATG 


2280 


GCAAGGAGTT 


ATTACTCAGC 


TGGTGATATT 


AGCGATGTCT 


ACATTGGTTT 


ATTTTCCATT 


2340 


CTTTAAAGTA 


CAGGATCGTT 


TAGCTTACCA AAATGAAATC 


AAACAATCTT 


AGAGGTATTT 


2400 


GTGTGTTACT 


GTTAAACTCA 


CACATTTGTB 


LTAAAAAi l A 


GAGAGTTAAA 


A'PTTTTCTAG 


2460 


TTAAAAGCTT 


GAAAATTTCT 


ATAAAAATCG 


GTATTATATT 


TTCGAAAGAA 


ATAAAAATAT 


2520 


TTTCGAAAGA 


AAGGTGCTTA 


CGATGGTAAA TACAGAAGTA 


GCAAGAACAA 


CAATCAAGAC 


2580 


AGAATATTTT 


GGCAGCCTTA 


CTGAAAGGAT 


GAACAAATAT 


CGAGAAGATG 


TTTTAAATAA 


2640 


AAAACCTTAT 


ATTGATGCTG 


AGAGAGCAGT 


TCTAGCAACA 


CGCGCCTATG 


AACGATACAA 


2700 


GGAACAACCT 


AATGTCCTAA 


AACGTGCATA 


TATGCTGAAA 


GAAATTTTGG 


AAAATATGAC 


2760 
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TATCTATATT GAAGAAGAAT CTATGATTGC GGGAAATCAA CCTTCTTCCA ATAAAGATGC 2820 

TCCTATTTTT CCGGAATATA CGCTAGAATT TGTTCTCAAT GAGTTGGATC TTTTTGAAAA 2880 

GCGTGATGGA GATGTTTTCT ATATTACAGA AGAAACAAAA GAACAACTTA GAAGTATTGC 2940 

TCCGTTTTGG GAAAATAATA ATTTACGTGC TAGAGCTGGT GCCTTATTAC CTGAAGAAGT 3000 

GTCTGTTTAT ATGGAAACAG GATTCTTCGG TATGGAAGGT AAGATGAATT CTGGAGATGC 3060 

TCACTTAGCA GTTAACTATC AGAAACTTTT GCAATTTGGT TTAAGAGGTT TTGAAGAGCG 3120 

GGCTCGTAAA GCAAAAGTAG CTCTAGATTT AACAGATCCA GCAAGTATTG ATAAATATCA 3180 

TTTTTACGAC TCTATATTTA TCGTAATCGA TGCTATTAAA GTATATGCAA AGCGCTTTGT 3240 

TGCTCTTGCT AAAAGTTTAG CCGAAAATGC AAATCCTAAA CGTAAGAAAG AATTACTTGA 3300 

GATTGCAGAT ATTTGCTCTA GAGTCCCATA TGAACCGGCA ACTACTTTTG CAGAAGCTAT 3360 

TCAATCAGTT TGGTTTATTC AATGTATTTT ACAAATTGAA TCTAATGGCC ACTCTCTTTC 3420 

ATATGGCCGT TTTGATCAAT ATATGTATCC ATATATGAAG GCTGATTTAG AAAGTGGTAA 3480 

AGAAACAGAA GATAGCATTG TTGAACGTCT GACAAATCTT TGGATTAAGA CAATTACAAT 3540 

TAATAAGGTT CGCAGTCAAT CACATACATT TTCTTCAGCA GGAAGTCCTT TATATCAAAA 3600 

TGTTACAATT GGTGGACAGA CTCGAGATAA GAAGGATGCT GTTAACCCAT TATCTTATTT 3660 

GGTATTAAAA TCAGTTGCAC AAACCCATCT ACCGCAACCT AATCTAACTG TACGTTACCA 3720 

TGCAGGTTTA GATGCTCGTT TCATGAATGA GTGTATTGAA GTGATGAAAC TTGGTTTTGG 3780 

TATGCCTGCA TTTAATAATG ATGAGATTAT TATTCCTTCT TTTATTGCAA AAGGAGTATT 3840 

GGAAGATGAT GCTTATGATT ACAGTGCCAT TGGATGTGTT GAAACGGCAG TTCCAGGGAA 3900 

ATGGGGCTAT CGTTGCACAG GTATGAGTTA TATGAACTTC CCTAAGGTTC TACTTATCAC 3960 

GATGAATGAT GGAATTGATC CGGCTTCGGG TAAACGGTTT GCACCAAGCT TTGGTCGTTT 4020 

TAAGGATATG AAGAACTTTT CTGAATTAGA AAATGCTTGG GATAAAACAC TAAGATATTT 4080 

GACACGAATG AGTGTTATTG TTGAAAATTC TATTGATTTA TCATTGGAAC GAGAAGTTCC 4140 

TGATATTCTA TGTTCAGCAT TGACTGATGA TTGTATTGGT CGTGGAAAAC ACCTTAAAGA 4200 

AGGTGGAGCA GTATATGATT ATATATCAGG ATTGCAAGTT GGAATTGCAA ATTTGTCGGA 4260 

TTCATTAGCT GCAATTAAAA AATTGGTGTT TGAGGAAGAA CGTATAAGCC CAAGTCAGCT 4320 

TTGGCATGCA CTGGAAACAG ATTATGCCGG AGAAGAAGGT AAGGTCATTC AAGAAATGTT 4380 

GATTCATGAT GCACCTAAGT ATGGTAATGA TGATGATTAT CCTGACAAAT TGGTTACTGC 4440 

TGCTTATGAC ATTTATGTTG ATGAAATTGC TAAATATCCT AATACACGTT ATGGAAGAGG 4500 

GCCTATTGGA GGAATTCGTT ATTCAGGAAC ATCTTCTATC TCAGCCAACG TAGGGCAGGG 4560 
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ACGTGGAACA 


TTAGCAACTC 


CAGATGGACG 


CAACGCGGGT 


ACACCGTTAG 


CAGAGGGTTG 


4620 


TTCACCATCA 


CATAATATGG 


ATCAACACGG 


CCCTACATCT 


GTTTTAAAAT 


CTGTTTCAAA 


4680 


ATTACCAACA 


GATGAAATCG 


TAGGTGGGGT 


TCTCTTAAAT 


CAGAAAGTAA 


ATCCTCAAAC 


4740 


GTTAGCCAAA 


GAAGAAGATA 


AATTAAAACT 


AATTGCTTTG 


TTACGAACAT 


TCTTTAATCG 


4800 


TTTACATGGG 


TACCATATTC 


AATACAATGT 


TGTTTCCAGA 


GAGACGCTGA 


TTGACGCTCA 


4860 


GAAACATCCT 


GAAAAACACA 


GAGACTTAAT 


TGTTCGTGTT 


GCAGGATACT 


CTGCATTCTT 


4920 


CAATGTTCTT 


TCTAAGGCAA 


CCCAAGATGA 


CATTATAGGA 


CGTACTGAGC 


ATACTTTGTA 


4980 


AAATAAAGAG 


GTTCTTTTTA 


TGGAATTTAT 


GCTTGACACA 


TTAAATTTAG 


ATGAGATTAA 


5040 


AAAGTGGTCT 


GAAATTTTGC 


CGCTAGCTGG 


GGTAACTTCA 


AATCCCACTA 


TTGCAAAAAG 


5100 


AGAGGGTTCT 


ATTAATTTTT 


TTGAACGAAT 


CAAAGATGTA 


AGAGAATTGA 


TTGGCTCTAC 


5160 


ACCCTCTATT 


CATGTTCAGG 


TGATTTCTCA 


AGATTTTGAA 


GGCATCTTAA 


AGGATGCTCA 


5220 


TAAAATTCGA 


AGACAAGCAG 


GAGATGATAT 


ATTTATCAAA 


GTACCTGTTA 


CTCCAGCTGG 


5280 


ATTACGTGCA 


ATAAAGGCGC 


TAAAAAAAGA 


GGGCTACCAT 


ATCACTGCAA 


CAGCTATTTA 


5340 


TACAGTTATT 


CAGGGATTAT 


TAGCTATCGA 


AGCAGGAGCG 


GATTACCTAG 


CTCCATATTA 


5400 


TAATAGAATG 


GAAAATCTGA ACATTGATTC 


AAATTCTGTC 


ATTCGTCAAT 


TAGCTCTTGC 


5460 


TATTGATAGA 


CACAACTCTC 


CTAGTAAGAT 


TTTAGCTGCA 


TCCTTTAAAA 


ATGTAGCACA 


5520 


AGTAAATAAT 


GCTTTAGCTG 


CAGGTGCGCA 


TGCTGTTACA 


GCAGGAGCGG 


ATorrriTGA 


5580 


ATCAGCTTTC 


GCCATGCCAT 


CTATCCAAAA 


GGCGGTTGAT 


GATTTTTCTG 


ACGATTGGTT 


5640 


TGTTATTCAA 


AATAGTCGTT 


CCATTTAGAT 


AGAGAGGAAA 


TACATATGAG 


AATTTTTGCT 


5700 


AGTCCTTCTA 


GATATATTCA 


GGGGGAAAAT 


GCCTTGTTTG 


AAAATGCCAA 


ATCAATTTTG 


5760 


GATTTGGGAA 


ATTGCCCTAT 


TCTATTATGC 


GATCAGTTGG 


TTTATGATAT 


TGTTGGAAAA 


5820 


CGATTTGAAG 


ATTACCTACA 


TAGGTATGGT 


TTCCATATTG 


TTCTGGCGCT 


ATTTAATGGT 


5880 


GAAGCTTCTG 


ACAATGAAAT 


CAATCGAGTT 


GTTGCCTTGG 


CTGAGAAAGA 


AAATTGTGAT 


5940 


AGTATTATCG 


GTCTTGGTGG 


GGGAAAGACG 


ATTGATAGCG 


CAAAAGCTAT 


TGCAGATTTG 


6000 


ATTGAAAAGC 


CTGTTATTAT 


TGCTCCAACA 


ATTGCATCGA 


CCGACGCACC 


TGTATCTGCT 


6060 


TTATCTGTTA 


TTTATACAGA 


TGAAGGTGCA 


TTTGATCATT 


ATCTATTTTA 


TTCTAAAAAT 


6120 


CCAGATTTAG 


TTTTGGTTGA 


TACAAAAGTT 


ATTTCACAAG 


CCCCTAAGCG 


TTTATTAGCG 


6180 


TCTGGTATTG 


CAGATGGTTT 


AGCAACTTGG 


GTTGAGGCGC 


GTGCGGTTAT 


GCAGGCAAAT 


6240 


GGAAAAACTA 


TGTTGGGACA 


ACAGCAAACA 


TTGGCTGGAG 


TTGCAATTGC 


GAAGAAATGT 


6300 
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GAAGAAACGC TGTTTGCAGA TGGTTTACAG GCTATGGCAG CTTGTGAAGC TAAAGTGGTG 
ACACCAGCAT TAGAAAATAT TGTTGAAGCT AATACTTTAT TGAGTGGTCT AGGTTTTGAA 
AGTGGAGGAT TAGCTGCGGC GCATGCAATT CATAATGGTT TTACTGCATT GACAGGTGAC 
ATTCATCATT TAACACATGG TGAAAAAGTA GCTTATGGAA CTTTAGTACA ACTATTATTG 
GAAAATAGAC CTAAAGAAGA ACTTGATAAG TATATTGAGT TTTACAAAAA AATTGGTATG 
CCAACAACTC TAAAAGAAAT GCATTTGGAT CAAGTTGGAT ATGATGATTT AATAAAAGTT 
GGTAAACAAC CAACTATGGA GGGTGAGACA ATTCATCAGA TGCCGTTTAA GATTTCGCCT 
TCAGATGTTG CTCAAGCTAT TATCGCTGTA GATGCCTATG TAAATTCAAA ATAAACAATA 
AGGACTACTG TTTTCCAAAT GGTAGTCTTT TATTGATCCC TGTATTGAAT TCTATAGAAG 
ATTGAAATAG GATGAGAACA AATCGATTGC GAAAGTAAAA TTAATTTCTA TAAATGTTTT 
AGCAATTGTT TCGTACTATT TCAGATTCAG TCTACTATAT GTTCTTCATA AATCAAAAAG 
CGACATAGGT TGTCGGCTAT TTATTGTGAA TACATTAATT AGCATTCCAG TTTTATCTTC 
GGTCTAAAAT AAGTATTTTG TGCTATACGA GATAAGCTTC TTGACTTACT CCTTGATTTA 



CTGCATAACA 

AGATACAGTA 

AGTAACATCA 

TAAGTGGCAG 

ACCTAAGTTT 

ACACCCAGAA 

CAATGTCCTT 

GGATACTGGT 

AATTAATGCG 

CTACTACAAG 

AGCTGAAGTG 

GCTTCCTGAC 

ATGGATGCTC 

TTGGTCAGAG 

TGTAACTTTC 

CACACTTTTC 

CACAAGTTCA 



ATGGGATAAA 

TACAGTTTTT 

CAAAATTGGA 

GGCTACTGGG 

TATGCGCTTG 

GGTTATACTG 

CACCCAATGG 

AATGACCCAG 

CTTGGATTTT 

TGGACTCAAT 

CCAGTAAACT 

GGAACTTCTG 

AAAATCACGG 

TCTATCAAGG 

AAAGTAAAAG 

GGTGCGACTT 

GAGCAAGCAG 



AAGTGGGAGA TAGAGCAATT CATAGTCATC AAAATTAATG 
CCTTTAAACA CATTTCAAAT TCCCTCAAAA ATGGTATAAT 
GAGAGACCAT GAGTTTTTAC AATCATAAAG AAATTGAGCC 
CAGAACATCA TACATTTAAG ACAGGAACAG ATACATCAAA 
ATATGTTCCC TTATCCGTCT GGAGCTGGTC TGCACGTAGG 
CAACCGATAT CCTCAGTCGT TACAAACGTG CGCAAGGCTA 
GTTGGGATGC TTTTGGTTTG CCTGCAGAGC AATACGCTAT 
CAGAATTTAC AGCGGAAAAC ATTGCCAACT TCAAACGTCA 
CTTATGACTG GGATCGTGAA GTCAACACAA CAGATCCAAA 
GGATTTTCAC CAAGCTTTAC GAAAAAGGCT TGGCCTATCA 
GGGTTGAGGA ATTGGGAACT GCCATTGCCA ATGAAGAAGT 
AGCGTGGAGG CTATCCAGTT GTCCGCAAAC CAATGCGCCA 
CTTACGCAGA GCGCTTGCTC AATGACTTAG ATGAACTAGA 
ATATGCAACG CAACTGGATT GGTAAATCAA CTGGTGCCAA 
GAACAGACAA GGAATTTACA GTCTTTACTA CTCGTCCGGA 
TCACTGTCTT GGCTCCTGAA CATGAATTAG TAGACGCTAT 
AAGCTGTAGC AGACTATAAA CACCAAGCCA GCCTTAAGTC 



6360 

6420 

6480 

6540 

6600 

6660 

6720 

6760 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 
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TGACTTGGCT CGTACAGACC TTGCTAAAGA AAAAACAGGG GTTTGGACTG GTGCTTATGC 8160 

CATCAACCCT GTCAATGGTA AGGAAATGCC AATCTGGATT GCAGACTATC TCCTTGCTAG 8220 

TTATGGAACA GGTGCGGTTA TGGCTGTGCC TGCCCACGAC CAACGTGACT GGGAATTTGC 8280 

CAAACAATTT GACCTTCCAA TCGTCGAAGT ACTTGAAGGT GGAAATGTCG AAGAAGCTGC 8340 

CTACACAGAG GATGGCCTGC ATGTCAATTC AGACTTCCTA GATGGATTGA ACAAAGAAGA 8400 

CGCTATTGCC AAGATTGTGG CTTGGTTGGA AGAAAAAGGC TGTGGTCAGG AGAAGGTTAC 8460 

CTACCGTCTC CGCGACTGGC TCTTTAGCCG TCAACGTTAC TGGGGTGAGC CAATTCCAAT 8520 

CATTCATTGG GAAGATGGAA CTTCAACAGC TGTTCCTGAA ACTGAATTGC CGCTTGTCTT 8580 

GCCTGTAACC AAGGATATCC GTCCTTCAGG TACTGGTGAA AGTCCACTAG CTAACTTGAC 8640 

AGATTGGCTT GAAGTGACTC GTGAAGATGG TGTCAAAGGT CGTCGTGAAA CCAACACTAT 8700 

GCCACAATGG GCTGGTTCAA GCTGGTACTA CCTCCGCTAT ATTGACCCGC ACAATACTGA 8760 

GAAATTGGCT GATGAGGACC TCCTCAAACA ATGGTTGCCA GTAGATATCT ACGTGGGTGG 8820 

TGCGGAACAT GCTGTACTTC ACTTGCTTTA TGCTCGTTTC TGGCATAAAT TCCTCTATGA 8880 

CCTCGGTGTT GTTCCGACTA AGGAACCATT CCAAAAACTC TTTAACCAAG GGATGATTTT 8940 

GGGAACAAGC TACCGTGACC ACCGTGGTGC TCTTGTGGCA ACCGACAAGG TTGAAAAACG 9000 

TGATGGTTCC TTCTTCCATG TAGAAACAGG GGAAGAGTTG GAGCAAGCGC CAGCCAAGAT 9060 

GTCTAAATCG CTCAAGAACG TTGTTAACCC AGACGATGTG GTGGAACAAT ACGGTGCCGA 9120 

TACCCTTCGT GTTTATGAAA TGTTTATGGG ACCACTCGAT GCTTCGATTG CTTGGTCAGA 9180 

AGAAGGTTTG GAAGGAAGCC GTAAGTTCCT TGACCGAGTT TACCGTTTGA TTACAAGTAA 9240 

AGAAATCCTT GCGGAAAACA ATGGTGCTCT TGACAAGGTT TACAACGAAA CAGTCAAAGC 9300 

TGTTACTGAG CAAATTGAGT CTCTCAAATT CAACACAGCT ATTGCCCAAC TTATGGTCTT 9360 

TGTCAATGCT GCTAACAAGG AAGATAAGCT TTATGTTGAC TATGCCAAAG GCTTTATTCA 9420 

ATTGATTGCA CCATTTGCAC CTCACTTGGC AGAAGAACTC TGGCAAACAG TCGCAGAAAC 9480 

AGGTGAGTCA ATCTCTTATG TAGCTTGGCC AACTTGGGAC GAAAGCAAAT TGGTTGAAGA 9540 

TGAAATTGAA ATTGTCGTCC AAATCAAAGG AAAAGTTCGT GCCAAACTCA TGGTTGCTAA 9600 

AGATCTATCA CGTGAAGAAT TACAAGAAAT CGCTTTAGCT GATGAAAAAG TCAAAGCAGA 9660 

AATTGACGGT AAGGAAATCG TGAAAGTAAT TGCGGTACCG AATAAACTCG TTAATATCGT 9720 

CGTTAAATAA CGAGTTTATT AGCTCTATCT GCCACCTTCA ATAGTCCACT GGACTATTGA 9780 

AsCCAACTAA ATTAGTTAAC ATTGTTGTGA AATAAGATAG GAGTCCTTCA GAGTAGAATC 9840 
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TGGAGGATTT TTTGAATCTT CTTATGAAAG TATGATATAC TATGGGCAAC TATAAAGTTT 9900 

GAAAAGTGAA ATAAGGAGAA TAAGATGCCA GTAAATGAAT ATGGTCAAAT GATTGGGGAG 9960 

TCAATGGAAG CTTATACTCC AGGTGAATTG CCTTCTTTTG ATTTCTTAGA AGGGCGTTAT 10020 

GCTAGGATAG AGGCTCTTTC AGTGGAAAAG CATGCGGAGG ATTTATTAGC TGTTTATGGC 10080 

CCTGATACGC CTCGGGAGAT GTGGACCTAC CTCTTTCAGG AGTCAGTAGC AGACATGGAG 10140 

GAACTGGTCA GCCTTTTAAA TCAGATGTTG GCTCGTAAGG ACCGTTTTTA TTATGCAATC 10200 

ATAGACAAGG CAACTGGTAA GGCTTTGGGA ACTTTTTCCC TCATGCGAAT TGATCAGAAT 10260 

AACCGAGTAA TAGAAGTGGG AGCTGTCACT TTTTCTCCAG AGCTCAGGGG GACACGGATA 10320 

GGAACAGAAG CCCAGTATCT CTTGGCTTGC TATGTCTTTG AGGAGCTTAA CTATCGTCGC 10380 

TATGAGTGGA AATGCGATGC TCTTAACCTG CCATCCAGAC GAGCAGCGGA ACGTTTGGGA 10440 

TTTATTTATG AAGGAACCTT CCGTCAGGCA CTGGTTTATA AGGGGCGTAC AAGAGATACG 10500 

GATTGGTTGT CTATGATTGA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAATATGG 10560 

TTGCGTCCTG AAAACTTTGA TAAAAATGGA CGACAGCACA AGAGCTTGAG AGAACTTTAA 10620 

GAGGTCTTGA GATGATTACT ATTAAAAAGC AAGAAATTGT CAAGCTAGAG GATGTTTTGC 10680 

ATCTCTATCA GGCTGTCGGT TGGACAAACT ATACCCATCA AACAGAGATG CTGGAGCAGG 10740 

CCTTATCTCA TTCATTAGTA ATTTATCTGG CACTTGATGG TGATGCTGTG GTGGGCTTGA 10800 

TTCGTTTCGT TGGAGATGGT TTTTCATCAG TTTTTGTACA GGATTTGATT GTTTTGCCTA 10860 

GCTATCAGCG TCAAGGGATT GGTAGCTCCT TGATGAAAGA GGCTTTAGGA AATTTTAAAG 10920 

AGGCCTATCA AGTCCAGCTG GCGACAGAAG AGACAGAAAA AAACGTGGGA TTTTATCGTT 10980 

CTATGGGCTT TGAAATCTTA TCCACCTATG ACTGTACAGG AATGATTTGG ATAAACAGAG 11040 

AAAAATAAAA AAACTPGTTT GTTCTTAAGC AAAGTTTAAG GATGGTCTAG TATCATATAG 11100 

TCATTAAATA AAGACCTCCT AACTTTATTT AATAAAATCC TAAACTTTTT TCATCACAAT 11160 

CTCCTAATGA AGCCACCCAA TCAGGTGGCT TT T TTGCGGT ACGACGGGCA TGTCGTATAT 11220 

CTGAGGTGTA AGTCCTCAGC CTGACTATCG TGAGGTAGCA GGGAGAGGAA GGGATAGCGA 11280 

AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT AAGGAGGCTT 11340 

CAAACTCTAA AGTCCAAAAA GGTAGTCGTA ACCTATATGT GTAAATCACG AGAGTAATTG 11400 

AATTCGGACT AAGGTTTGTG TGAAAAAGAT AAATCTTTCT AGAGTCTAAA GACTCTGCGT 11460 

CAGATTTCCT ATTTTCACTG TAACCTTTTA ACGTCCTCAT ATCTTGTATA AACGAGGAAA 11520 

GATGTACGAC TTATCCCGTG AGGTTTCATG AGCGCTGAAA GCGTAGTAAC AACGAATCAT 11580 

GAGAAGTCAG CCGAGCCCAT AGTAGTGAGG AAACTTCCGT AATGGAAGTG GAGCGAAGGG 11640 
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GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC 
GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 
GACTTTGAGG 

(2» INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4135 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



11700 
1X760 
11770 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

CGCGAAACTA CTTTCTTAGT ATAACACTTT CAGAATCATT GTCAATAGAA ATGACTTGAT 60 

TTTTTCAATT TTTTCAAGCT ATTTCCAAGG GTTGTAAAAT CGTCCCTGAT TCTGCAAGAT 120 

AAGTAGTAAA CTAACTACTA AAAACAAGGT TGCCAAGAGC AAGGTAATAT AGTCTCCTTT 180 

TTTCAAGGCC TGATAACTAT ACCATGTGCG TTTTTTCTCT TTCCCAAAGC GGCGAACTCC 240 

ATGGCAGTCG CAATGGTATC AATGCGTTCT AGCGAGCTAA AAATCAAGGG CGTAATAATG 300 

AGCAGATTGC CTTTGATTCG TTGCATAAGA GAAGCTTTCT TGGATAATTC CATCCCACGC 360 

GCCTCCTGAG ACATCTTGAT AGTAAAGAAT TCTTCCTGCA AATCTGGAAT ATAGCGCAAG 420 

GTCAGGCTGA CAGAATAAGC AATCTTATAG GCCACACCAA TTTGATTTAA ACTGGAAGCA 480 

AACTGACTAG GATGGGTTGT CATCAAAAAG ATAATAGCCA GAGGAATGGT GCAAAGATAC 540 

TTAATGGCCA AATTTAGCAG ATAAAAGAGC TCCTGGCTGG TTAGAGTGTA GACACCGATT 600 

CCCTGCCAAA TCACACTTCT CTCTCCATAA AGTCCAACCC CATACTCGGG AGAAAAGAGA 660 

TAGACCATCA AAACGTTTAA AACGGCAAAT ATCGTCGCAA AAACGGCTAC AAAGGAAACA 720 

TCTTTAAAGC GAATTTCTGA TAAATAGAGG AGAAAGACTG AAAAGATGGC AATCAGCAAG 780 

AGCATTCTGG TATCATAGCT AATCATGGCC GCCAATGATA CCAGAATGAA AAAGAGAAGT 840 

TTCCCAGCTC CTGACAAGCG ATGAATCACA GTATCTCTAT GCTGGTAACC GATTAATTTA 900 

GCTTGCATCC CTCTCTCCTT TCTTTGTAAA ATGCCGTTAA ATCCAGTGGA TCCACATCTA 960 

GTTTCTTAGC CAAGTTAAAG ATGGAGGTTT CTTTTAGATT GGCTTTTACT AACAGCTCAG 1020 

GATCGCTCAA CAGACTGGCT GGAACAGTAT CGGCAATCAA TTCTCCATCC ACCATGACAA 1080 

GGACCCGGTC TGAATAATCC AGCATCAATT GCATATCATG GGTAATCATG ACAATGGTAT 1140 

ATGTAACTCT TCGAGAAATT CCATAATCTC AGTATAGTTC TTCTGATCTT 1200 
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GACCTGCAGT CGGTTCATCT AGGAGAATAA TTTCAGCTCC TAAGACCAAA ATTGAAGCAA 1260 

TGCTGACACG TTTTTTCTGA CCAAATGACA GGGCAGAAAT AGGCCAATTA CGGAATTCAT 1320 

AAAGTCCACA GATTTTCAAG GTTTCATATA CTCTCGTTTC AATTTCCTTC TCATCCACAC 1380 

CTCGCAAACG GAGCCCTAGA GCCACCTCAT CAAAAATCAT ATTGGTTGAA ATCATTTGAT 1440 

TAGGATTTTG TAGCACATAT CCTACTCGTT CCGCCCGCTC TGCAACAGAA TCGCCTTTTA 1500 

TATCCTGTTT TTCCCAAAGA TAGCGTCCTT CCGTCTGAAT AAAGCTACTT ATAGCCTTGG 1560 

CTAGAGTTGA TTTCCCTGCT CCATTTTTTC CGACAATAGC AATCTTTTCA CCCTTTTTAA 1620 

TATCTAAATG TAGGGATTTT AAAATCGGTC TATCATCATA AGAAAAAGAT ACTTCCTCTA 1680 

GTCTAAAGAG TGACTGCAAT GCTGGGGTTT CTTTTGCCAG TTCATTCTGC AACTGAACCT 1740 

GACCTTTTGA GATAGACAAG TTATCCAGAT TCGCTAATTG TTCTTCCTTG ACTAAGTCCA 1800 

CACCTAATTG ACGGAGAGTC GTTAGATAAA GGGGTTCTCG AATTCCATTT TGAGTCAATA 1860 

AATCAGTCGC AAGCAACTGG TCAGGGCTCC CATTAAAAAG GATACGACCA TCGTTTATCA 1920 

AGACAATCCG ATCCACAGGG CGATGCAGAA CGTCCTCCAA ACGGTGCTCG ATAATAAGAG 1980 

TCGTCGTCCC CTCTTCCTTA TGAATCTGGT CAATCAATTC GATAATATCC TGACCTGACT 2040 

TGGGATCTAG ATTGGCGACT GGCTCATCAA ACAAGAGAAT CGGACTTTCA TCAATCAAGA 2100 

CACCAGCCAG ACTGACTCGC TGCTTTTGTC CACCTGACAA ATCCTGAGGA CGCTGATCCA 2160 

GTAAAGGAAG AAGGTCCAGC TTTTCAGCCC ATTTATAAAC ACGACCTTTC ATCTCATCTA 2220 

GGGCTGTCAC ATCATTTTCC AGAGCAAACG CCAAATCTTC TGCCACAGAC AAGCCAATAA 2280 

ACTGCCCATC TGTATCCTGC AAAACTGTGC TAACCAGATG AGACTTATCA TAGATGCTCA 2340 

TATCAAAGGC TACTTGACCC TTTATCAAAA ATTCTCCATA TGTCTGACCC TTGTAAATAT 2400 

TGGGAATAAT CCCATTCAAA CACTGACCCA AGGTAGATTT ACCTGACCCA GATGGTCCAA 2460 

CAATTAAGAC TTTCTCTCCC TTGTAAATGG TCAAGTCTAT CCCTTGCAAG GTCGGTTCTT 2520 

GTTGTGTTTC ATACCGGAAA GAGAAATCCT TCCACTCAAT TaTAGCTTCT TTCATCT1 AC 2S80 

TCTCTTCATT CGCTTCTTAG ACTTCTATTT TATCATAAAT CAAGCCCTTC TTGCAGTCTC 2640 

TCCTCTTAAA ATCTTAGCGC CAAAAAGATT CCTATCCTAG CTTACTTGCC TAACTAATCT 2700 

ATAAACATCG AAAAAGACTA GTTGCCCAGC CTTCCCCATC ATTTTATACT CTTCGAAAAT 2760 

CTCTTCAAAC CACGTCAGcT TCGCCTTGCC GTAGGTATGG TTACTGACTt CGTCAGTTTC 2820 

ATCTACAACC TCAAAACCAT GTTTTGAGCc TGCTTCGTCA GTTCTATCCA CAATCTCAAA 2880 

ACACTGTTTT GAGCAACtGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT 2940 

TA G T C CTTTT TCAAACTTCC TGCACGAGTT TGGGTTCCTG CATAGGCAAG TAAGAGAAGA 3000 
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GTTCCTGCAA TAGCTACAGA TACACCATTG GCAATTCCCG CAACAATCCC TTGTGCAAAT 3060 

ACTTTTTCTG CCGCTTCTTG ATAAATCACA ACATCTCCAA GTGGTGCCAA GACACCCCAA 3120 

ACAAGGGCAT TTGCAAGTAG TTGAATGAGA TTAAAAATAA GAATATCTTT CCAGTCAAAA 3180 

ACACCATTGA TCACGCGAAC GTACTTTCTA AAAAGTCCCA CAACTAAACC AAAGAGTCCG 3240 

CTAGCGATAA TCCAAGTCCA CCATAGACCA TAACCAACAA GAGAGTCCTT GATTGCATGA 3300 

CCAATCAACC CGACAAGCAA ACCGATAATC GGTCCAAAAA TAATAGAAAG TAGCGCTTGT 3360 

ACCGCATACT GAAGCTGGAT GCTTGTATTT GGAACAGGGG TTGGAATGTT GATCATCCCG 3420 

ATGACGACAA AGAGGGCAGC GCCAATTCCG ACAGCAACAA CTTGTTTAAT TGTAAATTTG 3480 

ATTTCCATAC TATTCTCCTA TTTTATCCTT CTATTTTCTT TATTTCAATG GTCCAAGATG 3540 

AACCGACACC TACATTATAG GCCTTGGCAA AGGAACCTTG GTTGATAGCC AAACCTAAAC 3600 

GATAGAGAGA GTTGATGTAA AGGATGGGTT GCCCAATTCT CACATCTGCA AATGATTTGC 3660 

CATAGACAAC CTGATTTTGA TAGACCAGCA TATCAGCATG ATAGATGGTC ACTTCAAAAC 3720 

GATCACCAAA TTCTGGTTCC AGCTTGTAAA ATTCTTCCCG TGTGATAGAG GTCCAAAGCG 3780 

AACCGAAACG CACATCCAGA ATATCAATGG CTCCCTTCAC CAGATGATCT TCTATGATGG 3840 

TCGCTACGAC TGGAAGCTCT ACAATCTGTT CCACACTGAG CTCTGGCCCT ACTTCCTCAA 3900 

AAGTAATGTG ACCACTGGCC AGTTTAGCAC CAGTATAGGC ATAGACATCA CGACCGTGGA 3 960 

AGGTATAAGA ATGCTCTGTG TTTTGACGCC TATTGGCCAC CTCAGAAATC TCACGAATGG 4020 

CTACAATGCC AACGTGTTTC TTGATAAAGG AAAGCGTCCC ATTATCTGGC GTGACAATGT 4080 

ATTGATTTTT TGCAGTCTTG GCAACTACAC TCTTACGTTT CGAACCGACA CCTGGATCGA 4140 

CAACCGATAC AAACGTCGTT CCCTCAGGCC AGTAATCCAC CGTCT 4185 
(2) INFORMATION FOR SEQ ID NO: 174: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 60 

TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 120 

CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 180 
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CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CTTCTTTTCT 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 

ATTGTAGTGA CTCTTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 

ATATTTCTCC AGAACATCTC CATCCTTAAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAAGCG CAGGAGAAAT CGACCAATAG 

GAAAGAAATA GCCTTAGAAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 

TTCTTCAGAT TTCTTCATCA ACTCAATCAG CTGTAAAATA CGGCTCTTCA TCTGATTATC 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTCTA TTTGCACAAA CCAAATCAGA 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA CTTTCGTTGA CCATACCTAA 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTCATTTCCT TGAGATAAGA TCAAGACATC 

TCCTATTTGA ACATCGGAAC TCTTGATACT AACAACCGTA TCGCCCTGTA CTAGGAATAC 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATCTGTT GCAGTTTTTT TCAAGGACCA 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 

ATAGCGAATA GGATACGGCA CAAAACTACG AAAAAGTACA CGCTTAACCG CTGCACCTGA 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GT CATC AACT CAGAAACTTT 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 

TATGCGTAGA GTAAAGTGCT GTTGATCCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 

TAGTCGGATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 

GGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA 

CATCGTCTCC ATGTTGCTTC ACAACAGAAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 2040 
CAACGGTTAA TACTTCTTTT AACATAGGG 
{2> INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459? base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi» SEQUENCE DESCRIPTION : SEQ ID NO: 175: 
AAATCTTGCG CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT 
TTTCAACGTC CCAACACATA ATCTTACGCA GATTTCTAAA ATTACACTCA 
CATTGAGCTT TCGAATGACA GTTGAAGTTG AAATGGCCAC CTGATGGGCA 
TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA 
GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA 
AACGACGTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG 
TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA 
TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT 
AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG 
TTTAGGTCGC TTGATAATAT GCGTTTTTTG AATTTTATAG ACTGCTCGTT 
TTACTTCGTA CCTTCTGGAG CGAGACCCAA TATTAGTCAC ATACAAAATG 
GGATTTTATT TTCATGTACA ATTTCAGCCA GTCTTGTTAT AATCAGCCTA 
GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG 
TGTCTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT 
GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT 
TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC 
GGAACGCTCT CAGTTCGAGG AGATGTTCCA ACTTCAAAGT AGTCGCTTGA 
AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG 
TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT 
GCCACCTGAA AAACGGAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT 
TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT 



CCCCGGACTG 60 

AAGTGGAAGT 120 

ATATCGGTCA 180 

GGAATTTGCT 240 

TAGCACTTAA 300 

TAAGGAATTT 360 

TATGATTCTT 420 

TTCTACCCAA 4 B0 

AGCCCAAAGT 540 

TAAACTCTAT 600 

AGTACTATTA 660 

TAGGAATCAA 720 

AAGTCCTTGA 780 

TTTGAATCTG 840 

CCTAATCAAT 900 

AGTTTGTTAT 960 

CGACGCAAGA 1020 

CTAAGAATTT 1080 

GGAAGCAGTT 1140 

CTCATTTTCG 1200 

ATTTTTTGGT 1260 



WO 98/18931 



PC17US97/19588 



1106 



ACTGGATTTG 


GATGAAGGAG 


ATTGGAAAGA 


AGCTGGTTTA 


ACCATTCGAA 


GAATAGCCAG 


1320 


GGAACGCCAG 


ATGGAAGCCC 


ATTTAGAGAT 


TTCTCGTTCG 


GGTCACGGAC 


TCCATATTTG 


1380 


GTTCTTCTTT 


GAGGAAGCGA 


TTCCGAGTCG 


AGAGGCTCGC 


TTGTTTGGAA 


AGAAACTGAT 


1440 


AGAACTGGCA 


ATGCAGGAAA 


GTATGCAACT 


GTCCTTTGAT 


TCTTTTGATC 


GCATGTTTCC 


1500 


AAATCAGCAT 


GTCCTTCCTA 


AGGGGGGATT 


TGGAAATTTG 


ATTGCCTTGC 


CTTTTCAAGG 


1560 


AGAAGCTTAC 


CATCAAGGGC 


GAACGGTCTT 


TGTGCATGAA 


CAGTTTCAGC 


CTTATGAAGA 


1620 


CCAATGGAGG 


TATCTACAAG 


AAATTCAGAG 


GATTTCAACT 


GCTAAAGTGG 


CACTGTTAAT 


1680 


CCAAGAG6AG 


TTAGGCAAGC 


AAGAATTGGA 


TAAGGAGTTG 


AAGGTCCTTT 


TATCCAATAT 


1740 


GATCCAACTT 


GAAAAATCGT 


CTGTGACATC 


CAAGGCACTT 


TTTTCTTGAA 


AAATATGGCT 


1800 


TCCTTTTCTA 


ATCCCGAATT 


TTATAGTAGA 


TTGAAACTAG 


AATAGTACAC 


CTCTGCTTCT 


1860 


AAAACATTGT 


TAGAAATCGA TTTGACTTTC 


CTGATCGATT 


TGTCCTGTTA 


TTATTTCATT 


1920 


TTACTATATT 


TAAAGCAGGC 


TATGCGACAG 


CCAACCTATC 


AAATTCCTGA 


GAGAATGTAT 


I960 


TTATTTGGAG 


AATCCGATCA TTATTTATGG 


TTGCCAAGAG 


GTTTGCTGTA 


TCCATTGCAA 


2040 


GATAAATTTA 


AGCAGGTATC 


TGTGGAAGAT 


AGGAGAAAGG 


TACAAAGGTC 


TATTAGCGTG 


2100 


GAATTTAAGG 


GAGAACTCAC 


TTTTGAGCAA 


GAGTTAGCCC 


TGTCAGATAT 


GACTTCTAAA 


2160 


GAAAATGGTT 


TACTTCATGC 


GGAGACTGGT 


TTTGGGAAGA 


CCGTTTTAGG 


TGCTGCTCTT 


2220 


ATCTCTGAAC 


GGAAAACAAA 


AACAATTATT 


CTAGTCCATA 


ATAAGCAACT 


CTTAGACCAA 


2280 


TGGCTAGATC 


GCTTAAACTG 


CTTTTTGACT 


TTCGAAGAGG 


AGGAGGCTAT 


CCCTTATACG 


2340 


GCATCAGGTC 


GTGAAAAGGT 


AATCGGCTAT 


GTTGGGCAGT 


ACGGTGGGAC 


TAAGAAATGG 


2400 


CTGAGTAAAC 


TGGTTCATGT 


CGTTATGATT 


CAATCTCTAT 


TTAAGTTGGA 


AAATAGTCAA 


2460 


AGTCTTTTGG 


ATGAGTATGA 


GATGATGATT 


GTGGATGAGT 


GTCATCATGT 


CTCTGCCTTG 


2520 


ATCTTTGAAA 


AAGTTGTTGC 


TCAGTTTAGA 


GGGAAGTATC 


TTTACGGTTT 


GACGGCTACG 


2580 




AGAATGGTCA 


TGAGCCTATT 


GTTTTTCAGA 


GAATTGGTGA 


GATACTCO.T 


2640 


ACTGCTGATA 


AGAGGGAAAC 


GGATTTTAAA 


CGGCAATTGC 


AATTAAGATT 


CACTTCTTTT 


2700 


GGTCATTTGG 


AAATTGAAAA 


GACCAAAGCA 


AGTAATTTTA 


TACAGCTTAC 


TGATTGGATT 


2760 


GCTACTGACT 


CAGTGAGGAA 


TCAGATGATT 


CTCAAGGATA 


TTCTAGCCCA 


AGTGGCAGAA 


2820 


GGACGGAATA 


TCTTGGTTTT 


AGTTAATCGA ATTCAACAGA TAGATGTCTT 


TGAAAAATTA 


2880 


TTGAAAGAGA 


AAGAGGTTGA 


TGACTGTTAC 


ATTATTAGCG 


GAAAAACCAA 


AGTCCGAGAG 


2940 


AGAACGAGTT 


TACTGGAGAC 


GTTAGAACAG 


TTAGATAAAG 


GGTTTGTTTT 


GTTGTCTACT 


3000 


GGAAAATACA 


TTGGCGAAGG 


TTTTGACTTA 


CCTCAGTTGG 


ACACGCTTAT 


CTTGGCAGCA 


3060 
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CCCTTTTCTT GGAAAAATAA TTTGATTCAG TATGCAGGTC GGATTCATAG AAACTACAAG 3120 

GATAAGTCTT TGGTGCGTAT TTTCGATTAT GTGGATATTC ATGTTCCTTA TTTAGAAAAG 3180 

ATGTTTCAGA AACGACAAGT AGCTTATCGA AAGATGGATT ATCGTGTCAT CGAGGGTGAG 3240 

GAGAAACAAT TCGTTTATCT TGATAGTAGA TATGAGAAGG TGTTGAGAGA GGACTTAGCA 3300 

GGGGAAAGAC AGGAATGTCT GCTTATTTTA CCTTATGTGC ACCAGACAAA ACTGATGAAT 3360 

TTTCTAAAAG AATTTAGGAT TAGTCAAATT GAGATATGTA TACCAGAGAC GGTTGCAAAT 3420 

AAAGCATGGC TAGACCAGTT GAAGAGCCAG AAAATTAAAG TGTCTTTTAC TCAATCAAAA 3480 

ATAGTAACGC CTATTCTTTT GGTGAATAAG ACTATTGTTT GGTATGGTGC AATGCCATTA 3540 

TTAGGGAAGG TAGATGAGAT GACCATATTA CGTTTGGAAT CAGCTAGTAT AGTTTCTGAA 3600 

CTAGTGGCAG GTTTACGATA GAGAAAATTT TTAAAAATTT CTATGTATGA TTTTCATTTC 3660 

TTTAGTGAGA CTGTTGCCAT TATCACATTC GAATCACACA AAATAAAAAA ATTTTTATAA 3720 

GTACTTGACA AATAGATTGA AATATCATAA AATAAAAACG GTTACAGAGT TATTAATTAT 3780 

TTAAGCTTCA TGTCACCATT AAAAATTGAA ATAAAAGGAT GTTATCACTA ATACAAGTGA 3840 

GCAGGAACCT ATTTAATCAC ATCAGAAGAA GTTTCTTGAT GTTTTTAAGT AGGTTCCTTT 3900 

TATTTTAAAA GGGAAATTTT ATGATCATAA AACGAATACT AAACCACAAT GCCGTAATTG 3960 

CGCAAAGTAA AAAAGATATC GATATTCTTC TTTTTGGAAG GGGAATAGCT TTTGGAAGAA 4020 

AAACTGGAGA TAAAGTAAAT CCAATTGATA TTGAGAAAAG TTTTTTTCTC AAAAATAGAG 4080 

ATAATATGAC CCGTTTTACA GAGATGTTTA TTAACGTTCC TTTGGAGTTG GTGTACATCA 4140 

CCGAAAAAAT AATTAACCTA GGTAAAATAA CATTGGGTAA TAATTTTGAT GAAATTATCT 4200 

ATATTAATTT AACGGATCAT ATTTCTTCGA GCATAGAACG TTATAAAGAA GGGATTATTA 4260 

TTTCGAATCC CCTACGCTGG GAAATATCGA AATATTATAA AGAAGAATTT GAACTTGGGA 4 320 

AAAGGGCTTT ACAAATAATA AAAAAAGAGT TAGGTATTGA ACTTCCAATT GACGAAGCTG 4 380 

CATTCATAGC GCTACATTTT GTTAATGCTA ATTTAGAAAA TAATTTTCAA GAGTCGTATA 4440 

AAATCACTGA AATAATTATG GGAATTGAGA AAATCATTCA AGATTTCTAT TGTACTGAGT 4500 

TTAACCAAGA TTCTATTGAT TATTATAGAT TCATAACTCA TATGAAATTA TTTGCCCATC 4560 

GCTTGGTTGA GAATACAACT TATTGTGACG ATGATGA 4597 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CGGCTTATTT 


ACTACTTGTT 


CCATCATATA 


TGGAATATGC 


ATGAAACCTG 


CTCTCATATT 


60 


AGGGAATTTT 


TTATCCACTA 


AATAAAGAGC 


TTGGTACATC 


AAATGATTGC 


AAACAAAGGT 


120 


TCCTGCACTA 


TTGGATACAA 


CTGCCGGAAC 


TCCCTGTTTT 


TTGATAGCTT 


GTACCATCGC 


180 


TTTGATAGGT 


AAACTACTAA 


AATAGGCCGA 


TGCTCCATCA 


ATACGAATCG 


GTGTATCAAT 


240 


TGGTTGATTG 


CCTTCGTTAT 


CAGGTATGCG 


AGCATCATCT 


TGATTAATAG 


CCACTCGTTC 


300 


AGGTGTTAAG 


CCGGTCCTGC 


CGCCTGCTTG 


TCCAATACAA 


AGTACAGCAT 


CTGGTTGATA 


360 


TPGTAATATT 




AAACTTCTGA 


CGACTTATAA 


AAAACCGTTG 


GAATTTCTAC 


420 


CCAGCGAACT 


TCAGCCCCAT 


TAATCTCAGA 


TGGTAATAAT 


TTTACAGCCT 


CCAAAGC7GG 


480 


ATTAATPTTT 


TCACCTCCAA 

A Wtwb A- W W^*#» 


AAGGATTAAA 


ACCTGTAACC 


AAT ATTTTC A 


TTTT ATTTTC 


540 


f**f*T w PAf*l*AAA 


ATGCGAGAAA 


GTACATTAAG 


AATATGTGAA 


TAACAATCAT 


TACTAGAGCA 


600 


Af* Af*TTR C*TT 


GAGCCTTTAT 


AACGCCATTC 


TGATCTTTCA 


TATCCATCAA 


TGCTGCTGGT 


660 


AGAGPGTTAA 


AATTAGCAGC 


CATTGGGGTC 


AATAAGGTCC 


CACAATAACC 


TGCTGTCATG 


720 


GCAAGAGCAC 


CAGCCACAAT 


TGGATTAGCT 


CCCAGAGCAA 


ATACAAAGGG 


AACTCCAACA 


780 


rTTrcCTGTAA 

WW A kJV^ 4, w A nn 


TAACGGTGAA 


TGCTGCAAAA 


GCATTTCCCA 


TAATCATTGT 


GAATAGAACC 


840 


ATTCCAAGAA 

^mA\ A ww«w^v#yv 


CATAGGCCAA 

vA A J^wW W W*W» 


AACTCCTATA 


AAGCGACTAT 


CTGAAGGAAC 


AATACCGCTA 


900 


ATCAGATGAG 


AGATAACATC 


ACCAACACCT 


GCTACAGTAA 


AAATAGCCCC 


CAAAGCCCCT 


960 


AATAATTGAG 


GAACAATCCC 


ACTTGTTGAA 


ACTTGCTGAG 


TCATTCGATT 


ATTTTCTGAT 




AACAGACTCT 


TAGGGTGACT 


ATTGGTAATC 


ACAAGAACAG 


AAATTGTAGC 


AAACAAGGCG 


1080 


GCAAGGCTAA 


TCGAAATCTT 


GCTAAATTCT 


GGAATCATTT 


GCGCTAAGAC 


CAACGCAAGT 


1140 


ATTGCCATCA 


GCATAACTGG 


AATAAAAATT 


TTATTTTTCA 


ACCTGTTAGA 


TTCAATATTG 


1200 


GCTTTCATTT 


CATCTAAGGA 


TGGCAAGGTT 


CCGATACGGA CTTGCTTAAA 


CAATGTTAAC 


1260 


AGCGATAATA 


GGATTACAAT 


AATACCAATA 


CTCATATTTG 


GCATATAGGA 


ACCACCTATA 


1320 


AACGTAATAG 


ACAATAGAGT 


CCAAAATGCA GATGTCCCAA GTCGAACTGG GTlWHTTA 


1380 


TCTTTATAAC 


TACAATAGGC 


TGTATGGAGA 


AATTGACAAC 


CAATCACAAT 


ATAGGTCAAC 


1440 


TCTAATAGTT 


GCTTTGCCAA 


CTCTGTCATT 


TTTGTTCTCC 


TCCCCTAGTC 


TTTTTTGATA 


1500 


TCAATTTTTT ATCAAATAAA TAATTATAAA TCCCCACTAC 


AATAAGTGTT 


ATAACAGCAA 


1560 


CAATAATAGA 


TGTAGAAGCA 


ATCCCTGCAT 


AATTGCTTTC 


ATAGCCTAAC 


TGATCTAATG 


1620 



^0 9S/1893l 



PCT/US97/19588 



1109 



TTCCCCCTAT 


CAAGAGGACT 


CCCCCAGCAC 


CTACAAACGT 


ATTTTGAGCA 


AAGAAATTTC 


1680 


CAAAATTTTC 


ATTCGCAGCC 


GCACGCGCTT 


TTATTGTCTC 


ATCTTCAACC 


TCTUTTAACT 


1740 


TTCTACCTAA 


TTGAGACTCT 


GCAGCTGCTT 


CTCCCATAGG 


TTGAACCAAA 


GGTCTGACAA 


1 AAA 

1800 


ACTGAGGGTG 


TCCTCCTAGA 


CGAATTGAAA 


AGAAACCAGC 


TAACTCTCGA 


ATAAAGAAAT 


4 A 4* A 

1860 


AAACTGTATA 


GAAGTTTCCA 


ACTGTCAGAC 


CTTTAATCTT 


TCGAATCAAA 


TCGATTGATC 


1920 


GTTGCTTGAG 


TCCAAACGTT 


TCTGACAGCC 


CCACAAGAGG 


CAAGGTAACC 


ATAAAAATCG 


* A A A 

1980 


TGAGCACTCG 


CTGATTGCTA 


AATTCTTTTC 


CCAAAATCTC 


CAAAAATTCA 


ACGAGAGAAA 


2040 


CACCTGAAAC 


TAAAGCTGTA 


ACCAAACCAG 


CTAAGACTAC 


TGTTGCAATT 


GTATCAAATT 


4 A A 

2100 


TTAAAATAAA 


ACCCACAACA ATGATTGCTA 


TTCCTATTAA 


TCTAATCCAC 


TCCATATCAA 


2160 


ACTCCTTTAT 


ATTCAAAATG 


ACAGTATTTT 


TAAAATTTTA 


TCAAGATCAA 


TACCATTCCT 


2220 


TATTTAATGT 


GTTTTTCTAG 


TTCTTTTTGG 


TATTTGCTAT 


TGGATTCCAA 


TTTTTCTTTT 


2280 


TGCCATTTTT 


TAAAAACCTC 


GTTATATTCT 


TTTGTTGTAA 


CAATATCTTT 


TTGCAATTTC 


2340 


ATTCCTTTAA 


AGATATATGG 


ATCCCCCTTA 


ATACCAACTT 


GTGAGTATGG 


TTTTGAGAAT 


2400 


GGTACTACGT 


TACTTACAAC 


TGGAGAACCA 


CCAGATGAAG 


CTGTTGGCAT 


CAATAATGAA 


2460 


CTATCTGTCG 


ACCAAGCTTG 


AGCTTTGGCA 


TATTTTTCAT 


ATCTTTTCTC 


TAGGTCAGTG 


2520 


GTCTCAGAAA 


CAGCATCTTC 


TAACAATTTC 


TTATATTTAT 


CCAAACCAGG 


TTTAGCTACA 


2580 


ACATCCTTAT 


CTTTTCCTTT 


CGTAATACCA 


AGGTGTTTCA 


TGGCAGAACC 


AGATTTTGGA 


2640 


TCTATAATAT 


TCAAGTGAGA 


CGCTGGATCT 


TGATAGCTTG 


GAGCCCATCC 


TGTACTGTTC 


2700 


AAATCATAGT 


CTTTTTGAGA AGGAGCAACA TTGCCGTATT TATCATTTTC CATCAAACCA 


2760 


TCAATAACAT 


TTCCAATAAC 


GTCTGTCCTC 


GATGTTCGAG 


TCGCTATACT 


GTAGCCCAAT 


2820 


GATGCTGGAT 


CTACTGCATA GACATAAGAA 


AATGTTGTCG 


GTGCATCTGC 


TTCTTTATCA 


2880 


GTTTTTCCAC 


AAGCCACTAA AATAGCTGAC 


GTGCTCAGGA 


CCACTCCTGC 


TGTTAAGAGC 


2940 


CACTTTTTCT 


ATTTCATAAA 


GAATCTCCTT 


TGGTTTATTT 


TAATCTACTT 


TTACAATCCA 


3000 


ACCTTCTGGC 


GCTTCAATAT 


CGCCAAACTG 


AATACCCCTC 


AATTCATTAT 


ATAATTTACG 


3060 


CGTCACAGGA 


CCTACTTCTG 


TTTCACTATA 


GAATACATGG 


AAATCATCAC 


CATGTTGAAT 


3120 


ACCTCCAATT GGAGAAATAA CCGCTGCTGT ACCACAGGCA CCTGCCTCTA CAAAACGGTC 


3180 


AAGATTATCA 


ATTGGAACAT 


CACCCTCAAT 


AGGAGTTAAT 


CCCAAGCGAT 


GTTCTGCCAA 


3240 


ATAAAGCAAG 


GAATACTTGG 


TAATAGATGG 


CAAGATAGAT 


GGACTCAATG 


GTGTTACAAA 


3300 


TTCATTATCA 


GCTGTAATTC 


CAAAGAAGTT 


AGCTGATCCG ACTTCTTCAA 


TCTTTGTATG 


3360 
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AGTTGATGGG TCCAGATAGA TAACATCTGA GAAATGACGT GACTTGGCCA TTTTTCCTGG 3420 

TAAGAGACTT GCAGCATAGT TTCCACCAAC CTTAGCCGCA CCTGTACCAT TTGGTGCT G C 3480 

ACGGTCGTAC TCATCCTGAA TCAAGAAGTT GGTTGGGACC AAACCACCTT TAAAGTAATT 3540 

TCCAACTGGC ATAGCAAAGA TGGTGAAAAT GTACTCTTCT GCCGGTTTTA CCCCGATAAT 3600 

ATCTCCGACA CCAATCAAAA GAGGGCGAAG ATATAAGGTT CCACCTGTTC CGTATGGTGG 3660 

TACGTATTCT TCATTCGCAC GGACAACTGC TTTACAAGCT TCTACAAACA TGTCTGTCGG 3720 

AACTTGTGGC ATCAAGAGAC GGTCACATGT ACGTTGCAGA CGTTTAGCAT TTTCATCAGG 3780 

ACGGAACAGT TGAACACTGC CATCCTTAGT ACGATAAGCT TTCAAACCTT CAAATGCTTG 384 0 

TTGTCCATAG TGAAGACTTG GAGAAGACTC TGAAATATGC AAAGTTGCAT CCTCTGTAAC 3900 

CTCTCCTTGA TCCCATTGTC CATTTTTGAA ATGAGCAAGA TAGCGATAAG GTAATTTCAT 3960 

ATAGGAAAAA CCGAGGTTTT CCGG 3984 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TATCTAATTA TTGGTTTTTA TCCCTGACCT TGGCTATTGT TGGGGTTGTT TTACCCTTGT 60 

TGCCTACAAC ACCTTTCCTT TTGTTGTCTA TTG C TTGTTT CTCCAGAAGT TCCAAGCGAT 120 

TCGAAGATTG GCTTTATCAT ACCAAGCTCT ATCAAGCATA TGTAGCTGAT TTTCGTGAGA 180 

CCAAGTCTAT TGCGCGTGAA CGAAAGAAAA AAATCATCGT CTCTATCTAC GTCTTGATGG 240 

GAATTTCTAT TTATTTTGCA CCTCTTTTAC CAGTCAAAAT CGGTCTGGGT GCTTTGACCA 300 

TCTTTATTAC TTATTATCTC TTCAAGGTCA TTCCAGACAA AGAATAGTTA AAACAGTAC T 360 

TATTTGCCTT GATAAAATTG AAAGCATATT CATAACAATA TGATATAATA AAATTGAAGT 420 

AATATTCAAG GAGAATCAAA TGATTTACGA ATTTTGTGCT GAAAATGTGA CTTTACTTGA 480 

AAAAGCGATG CAGGCTGGAG CTCGTCGGAT TGAACTCTGT GATAATCTAG CAGTTGGTGG 540 

GACAACACCC AGCTATGGAG TGACTAAGGC AGCGGTTGAA CTGGCAGCTA ACTACGATAC 600 

AACCATCATG ACCATGATTC GGCCACGTGG TGGTGACTTT GTCTATAATG ACCTAGAAAT 660 

TGCTATCATG CTAGAAGACA TTCGTTTGAC TGCTCAGGCT GGAAGTCAAG GGGTTGTATT 720 

TGGAGCTTTA ACTGCTGATA AAAAGTTGGA TAAGCCTAAT CTGGAAAAGT TAATTGCTGC 780 



WO 98/18931 



PCT/US97/19588 



1111 



ATCAAAAGGA 


ATGGAAATTG TCTTtCACAT 


GGCC 1 1 1 vjAi 


OnAL I AAL* 1 V* 




HdO 


AGCGGAAGCT 


ATTGACTGGC TCAGTCAAGC 


CGGTGTCACT 


LI/1 A l Ll. I AA 






TGTGTCTGGC 


GACTCCTTAG AAAAACGTTT 


TGTTCACTAT 




luunu 1 AV-aj^. 


960 


TAAAGGTAAA 


ATTGAAATTC TACCAGGTGG 


GGGGATTGAC 




VvXs^AAACUl J 


1090 


TATCGACCAG 


GTGGGGGTAA CACAATTGCA 


TGGTACTAAG 


Qri i\» n n it 


AAAAAAIAUA 


1 ORO 
1UOV 


AAGGAACTGC 


TAGCTTTGGG TAGCAGTTTT 


CACTTATGTT 


TGAAATTTTT 


AAATCCTATC 




AATTTAATCA 


AGAAAAGGCT CATGATTATG 


GTTTTATAGA 


AAATAGCGAA 


GTCrGGACAT 


i inn 


ATAGTTGCCA 


GATTTTGCAA GGTGACTTTG 


TCATGACTGT 


GTCCATCACT 


GCTGATAATG 




TGAACTTTCA 


AGTCTTTGAC CAAGAGACTG 


GTGACCTCTA 


TCCTCACGTT 


TATATGGAAA 


i lift 
X J £ U 


GCATGAGGGG 


AAGTTTTGTC GCAAATGTCC 


GTGAGGCTTG 


TCTGGAGATT 


CTTTACCAGA 


« ion 


TTCGGAAGGC 


TTGTTTTGAT GTGCAAGATT 


TTATCTGTCA 


TCAGACTAAG 


CGTATCATCA 


144U 


CTCAAGTTCA 


GGAAAAGTAT GGAAACCAGT 


TGGAGTATCT 


GTGGGAAAAA 


TCGCCTGATA 


< cAn 
lbOU 


CAGCTGTATT 


GCGCCATGAA GGCAATCAAA AGTGGTATGC 


CGTCTTGATG 


AAAATCTCTT 




GGAATAAGCT 


GGAAAAGGGC AGAGAAGGAC 


AAGTGGAAGC 


AGTCAACCTC 


AAGCATGACC 


1020 


AAGTAGCTAA 


TTTGCTTTCA CAAAAGGGGA 


TTTATCCAGC 


CTTCCATATG 


AGCAAGCGCT 


■% C OA 


ACTGGATTAG TGTGTCCCTT GATGATACTT TATCAGATGA AGAAGTACTG GAATTGATAG 


1 TjI ft 


AAAAAAGTTG 


GAACTTAACC TCTAAAAAAT 


GAAATATTTT 


AATAATTTTC 


ATGAACTTTC 


i onn 


AATTAGCTAA ATATTCTTTA CTGAAGAGAT TTTTAGAAAA TATAGGATTT ACCACACTAG 


1 Q £ ft 


AGGAATATGG 


TGCCATCTTC AAATACCTGA 


TTGAGAATGT 


CAAGACGGAT 


CGTCAGATCA 


i m ft 


TCTATTCGCC 


TCACTGTCAT GATGACCTCG 


GAATGGCAGT 


GGCAAATAGC 


CTTGCTGCTG 


i a an 


TCAAGAATGG 


TGCAGGACGT GTTGAAGGGA 


CTATCAATGG 


TATTAGGGAG 


CGAGCTGAAA 


in * a 
ZU4 U 


ATGCTGCTTT 


GGAAGAAATT GCAGTGGCTC 


TCAATATTCG 


CCAAGATTAC 


TACCAAGTAG 


2X00 


AAACCAGTAT 


TGTCCTAAAT GAGACCATCA ATACGTCAGA 


AATGGTTTCT 


CGCTTCTCTG 


2160 


GTATTCCAGT 


TCCTAAAAAC AAAGCCGTCG 


TTGGTGGCAA 


TACCTTCTCC 


CACGAATCTG 


2220 


GTATTCACCA 


AGATGGAGTC CTTAAAAATC 


CTCTCACTTA 


TGAGATCATC 


ACACCTGAAT 


2280 


TGGTTGGTGT TAAGATTCTG CTTGGAAAAT TATCTGGTCG CCATGCTTTT GTTGAGAAAC 


2340 


TGAGAGAATT GGCCCTAGAT TTTACAGAAG AGGATATCAA ACCACTCTTT GCTAAGTTCA 


2400 


AGGCACTGGT 


CGATAAGAAG CAAGAAATCA CAGATGCAGA TATTCGAGCT TTGGTAGCTG 


2460 


GAACCATGGT TGAAAATCCA GAAGGCTTCC ACTTTGATGA TTTACAACTT CAAACTCATG 


2S20 
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CAGATAATGA 


CATTGAAGCG 


CTCGTTAGCC 


TAGCCAATAT 


GGATGGTGAG 


AAAGTCGAAT 


2580 


TTAATGCCAC 


AGGGCAAGGT 


TCCGTTGAAG 


CAATCTTTAA 


TGCTATCGAT 


AAcriirrrTA 


2640 


ACCAATCTGT 


TCGTTTGGTG 


TCCTACACTA 


TCGATGCGGT 


AACAGATGGA 


ATCGATACCC 


2700 


AGGATCGGGT 


TTTGGTCACT 


GTTGAAAACA GAGATACAGA AACCATCTTT 


AATGCAGCAG 


2760 


GGCTTGATTT 


TGATGTGTTG 


AAGGCTTCTG 


CTATTGTCTA 


TATAAACGCT 


AATACCTTTG 


2820 


TTCAAAAAGA 


GAATGCAGGT 


GAGATGGGAC 


GCAGTGTTTC 


TTACCACGAT 


ATGCCTAGTG 


2880 


TGTAAAGGAG 


AAGGCTATGG 


CAAAGAAAAT 


AGTAGCTCTA 


GCAGGAGACG 


GAATTGGCCC 


2940 


AGAAATCATG 


GAGGTTGGTT 


TAGAAGTTCT 


GGAGGCTCTA 


GCTGAAAAAA 


CAGGTTTTGA 


3000 


CTATGAGATT 


GACAGACGAC 


CGTTCGGAGG 


TGCAGATATT 


GATGCAGCAT 


GACCTCCCTT 


3060 


ACCTGATGAA 


ACCCTTAAGG 


CAAGTAGGGA 


AGCAGATGCT 


ATCCTACTAG 


TAGCTATCGG 


3120 


TAGTCCTCAG 


TATGATGGAG 


CAGTGGTTCG 


CCCTGAACAA 


GGCCTGATGG 


CTCTCCGTAA 


3180 


GGAACTCAAT 


CTTTACGCTA 


ATATTCGTCC 


TGTAAAAATC 


TTTGACAGTC 


TCAAGCATTT 


3240 


GTCACCACTC 


AAACTGGAAC 


GAATTGCTGG 


TGTAGACTTT 


GTCGTGGTGC 


GTGAATTGAC 


3300 


AGGCGGGATT 


TACTTTGGAT 


ATCATATTCT 


TGAAGAGCGC 


AATGCGCGTG 


ATATCAACGA 


3360 


CTATAGCTAT 


GAGGAAGTGG 


AGCGGATTAT 


TCGCAAAGCC 


TTTGAAATTG 


CAAGAAATCG 


3420 


CAGAAAAATC 


GTT ACT ACTA 


TCGATAAGCA 


AAATGTTCTA GCGACCTCAA 


AACTCTGGCG 


3480 


GAAAGTAGCT 


GAGGAAGTCG 


CACAGGATTT 


CCCAGATGTA 


ACCTTGGAAC 


ATCAGCTGGT 


3540 


AGACTCAGCT 


GCTATGCTTA 


TGATTACCAA 


TCCTGCTAAG 


TTTGATGTTA 


TTGTAACGGA 


3600 


GAATCTTTTT 


GGAGATATTT 


TATCTGATGA ATCAAGCGTC 


TTATCTGGTA 


CACTTGGGGT 


3660 


TATGCCATCA 


GCCAGTCATT 


CTGAAAATGG 


ACCAAGTCTC 


TATGAACCTA 


TTCACGGTTC 


3720 


AGCACCTGAT 


ATTGCAGGTC 


AAGGAATTGC 


CAATCCTATT 


TCCATGATTT 


TATCAGTTTC 


3780 


CATGATGTTG 


AGAGATAGTT 


TCGGACGTTA 


TGAGGATGCA 


GAGCGTATCA 


AACGTGCTGT 


3840 


TGAGACAAGT 


CTGGCGGCAG 


GAATTTTAAC 


GAGAGATATA 


GGAGGTCAGG 




3900 

»J J V V 


GGAAATGACG 


GAAGCTATTA 


TTGCAAGGTT 


ATGAAGTTAG 


ACGAAAAAAT 


TACTCTAGTC 


3960 


CTTTTGATTT 


GGAATGTCAT 


CATTTTCTTG 


ATTTATGGTA TTCACAAATC 


TAAGGCAAGG 


4020 


AGAAGAGTTT 


GGCGCATCCC 


TGAGAAAATC 


TTACTTATTT 


TAGCCTTTAC 


TTTTGGTGGT 


4080 


TTTGGTGCCT 


GGCTAGCAGG 


AATCATCTTT 


CACCACAAGA 


CTCGAAAATG 


GTACTTTAAA 


4140 


ATAGTTTGGT 


TTCTTGGGAT 


GGTGACCACA 


CTAGTAGCCT 


TATATTTTAT 


TTGGAGGTAA 


4200 


TGGATGGCAG 


GGTCTTCGAG 


GGAATACGCT 


GCTTGGGCTC 


TAGCGGACTA 


TGGTTTTAAG 


4260 


GTCGTGATTG 


CAGGATCTTT 


CGGTGACATT 


CATTACAATA 


ATGAACTCAA 


TAATGGCATG 


4320 
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TTGCCAATCG TTCAGCCTAG AGAGGTTAGA GAGAAACTAG CCCAGCTAAA ACCAACCGAC 4380 

CAGGTAACTG TGGACTTGGA ACAACAAAAA ATCATCTCAC CAGTTGAAGA ATTCACCTTC 4440 

GAGATAGATA GCGAGTGGAA ACATAAACTC CTAAATAGTT TGGATGATAT CGGTATTACC 4 500 

TTGCAGTATG AAGAGTTGAT TGCTGCTTAT GAAAAACAAC GACCAGCCTA CTGCCAGGAT 4 560 

TAGAAAAAAT AGAAAAGGAG ATATAGTAAA CTGAAATAAG ATGTAAACAA ATGAATTGGA 4 620 

GCTTAACATC CATTTCCAGC AATTTTTTAG AAACTACAGT GGACTATTCT GGATTCAACA 4680 

CATTATAAAA TTATGACAAA ACACATTCAC AAGAAGGCTA CGACATTTTA AAAGGTGAGG 4740 

GCGGATGTAT CGTTTGCCCT ACTAAAGTTG GTTACATTAT CATGACCAGT GACAAGGCAG 4800 

GACTTGAGCG TAAGTTCGCA GCCAAAGAAC GTAAGCGTAA CAAACCAGGT GTTGTTCTCT 4860 

GCGGTAGCAT GGATGAACTT TGCGCTTTAG CGCAACTCAA CCCAGAAATT GAAGCATTCT 4920 

ACTAAAAACA TTGGGATGAA GATATTCTTC TTGGTTGTAT CCTTCCTTGG AAACCAGAAG 4980 

CCTTTGAAAA ACTCAAAGCA TACGGGGATG GCCGTGAAGA ACTTATTACT GATGTACGTG 5040 

GTACTAGCTG TTTTGTTATC AAGTTTGGAA AAGCAGGTGA ACAATTGGCT GCCAAGCTTT 5100 

GGGAAGAAGG TAAAATGGTC TACGCCTCAT CTGCTTCAAT GACAAAACGA TTGAAACTCG 5160 

CTATGAGCAA GGTGTAATGG TGTCTATGGT CGATAAGGAC GGCAAACTCA TCCCAGAACA 5220 

AGGAGGAGCA CGTTCAACTT CACCAGCTCC AGTTG TGA TC CGTAAAGGGC TTGACATTGA 5280 

TAAAATCATG ATGCACCTGT CAGATACTTT TAACTCATGG GACTACCGTC AGGTTGAGTA 5340 

TTATTAGGAT AGAGAAGAAG TCTAGTGTTA TGAGATATTA AAGCTCCTAA CACTGGGCTT 5400 

TTGTTTAGAA TTTCTTTTCT TTTTCTATAG GATATGGTAT TCTATGTAGA AAATATATGT 5460 

TAATAAGTAA TGCCAATATT TAAACATCAT TAGTAAAAGG AGTTAGATTG ATGAATAAAA 5520 

GAAAAGTTAG TTTAGAAGAT TTTTATAAAT GGTATAGTCT AAATAAAGAA GAGTTATTAA 5580 

ATAAGGCAAC TGTTGGTGAA AAGTTTAATG ATAAATTAAA AGAAGACTTT CTCCAGGAAT 5640 

GGCCTTTGGA TAGGATTTTA ACAATGTCAA TCGATGAATA TGTAATAGGA AAGGGACAGC 5700 

AAAATAAGTC TTTATGCTAC GCTCTTGAGA AGGGAAAATA CAAAAATCTA TTTCTTGGAA 5760 

TTTCTGGTGG CTCAGCTTCA AAATTTGGTA TTTATTGGAA TAAAAAAACA AACAAATATA 5820 

AAGATCAAGC TAATAATGAG ATTTCAGAGT TGGATCAGCG ATTTTCAAAA TTAAAATCAG 5880 

ATTTGTATGA AATTATCAAA GAACGTATTC GTTTTAACTT TGAAAATCCT ATTTTTGATA 5940 

TGAAAAGATC AACAAATGAA TTTATTGGTC GTTCTGCTAT GGTGACAAAA TTACTTTGTA 6000 

TCTATACTGA GGGAGATCCT TTCTTTGGTG TAAATATTAA TAGTCAGAAA GAATTTTGGA 6060 
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ACCACTTTGT TTCTCAGACA AATCAAGGTG GACCTTATCT GCAAAATCAT AAAATAATTG 

AGCCATCGAA ATTAGGAACT ATGCTTTTTG 



AACTGGTGTC CAAAACTTAT 
AGTATTCTAA GCTTTTTATG 
ATTTTCGTCA TCAATTAACT 
CTCCTGGCAC GGGAAAAACT 
ACGAAGATCA AATCGGATTT 
AAGGTTTAAG ACCAGTATCA 
TTTTTAAAGA TTTTTGTCAG 
TTGATGAGGC TTGGGATTCT 
TAACAAAAAC ATCTTACTTA 
GTGGTGTTCC AGGATGGTCA 
ATTATAATAA GCAAGAATAC 
AGAGATTTGG TTTGAAAGAC 
TCTTCATCAT CGATGAGATC 
TCTCTATCGA CCCCGGCTAT 
TACACGAAAC TGATGAAAAG 
ATGATATTGA TCGTTCAGTG 
TTGAAGTTAC TGTCGAGGGT 
AAGAAGCAAA AATTCGTCTA 
ACAGTCATTA TCATATTGGA 
ATGAATTACT CTGGTCTGAT 



ATGATGAGGT TGAAACTTTG 



CCTGAGTTGG 
GAAAATAAGG 
CAATCTCTAT 
TATCTTGCTA 
GTACAATTTC 
AATGGGGATG 
AAAGCAAAAG 
TACTTAGAAT 
TCTGTTAATA 
CTACCTAGCA 
TACAAAAGTG 
TATGTTTCCC 
AATCGTGGGG 
CGTGGTGAAA 
TTCTATATCC 
GATACCTTTG 
CAAGCTGGCA 
AGAAACTTGA 
CCAAGTTATT 
TATATTAAGC 
GAaACTTTGA 



AAGACAATAG TACAATGGAT TCATCAAACA 
TAAAGTCTCC AAACCTCATC CTCCGCGGTG 
AAGAAATTGC TAAAGAATTA ACGGATGGCA 
ACCCATCATA TGATTATACG GATTTTGTAG 
GAGCTATTGA GTTTAGGCTA CAGGACGGTA 
AAACCCAATT GATTGGAGGA CAAGATAATT 
ATATAAATGT TGCTGAAGAA AAAGAATATA 
GTAGACAAAA TTTGTCAGTA AATTATGATA 
AATATGTTTA CGAGTTGTAT AAAGATAAAA 
GTGGAAAAAC TGTCCTAGAA ACATTGAGAA 
CAACAGAAAT TG AT ACT GAT AAGAATTTTG 



GATTTTTGGC GAACTCTTTT 
TTCTACCCAA TATGCAAATC 



AGATTTCTAA 
AAGGAAGTGT 
CCGAAAATGT TTACATCATC GGAACTATGA 



ATTTTGCTAT 
TGTTGGATAA 
ACGCTGCTAT 



GCGTCGTCGT TTTCGTTTTG 
AGAGTTGAAT ATCCATGCAG 
CGAAAATATT CAGGAATTAA 



AAAAAGATCA GGCAGTAGCT 



GATGACAATG 
GATAATCAAC ACAAGATTAT TAAAGAAAAA 
CCTCTTTTAG ACAGAACCTT GGAAAGTCTA 

TGATTTGGAT 

CAGAAAATCA AGACAGGGAA CGTGATTGGT 

TGAGAGTAAT 
TAGTTTAGAT 
TCCCAAGTAT 



TTCTTAAGTT GAAGGATGTA GATTTTGACT 
CTCTCCTAGA AGACTACTTG CGAGGTTCTT 
AAAAAGCATT TGATCTGACA AATAATGAGC 
AAGGCGATGA AAACGATGAT GCCGATTACT 
TTTGTTGAAG AATATCCTAA ACTAAGCAAT 



AATGATTwGA CTCATACTCC 



ATTTCCTCAC GATTTTCTGA 
GTTCTTCATA TCAATCTCAC 



CAA C TTTTGG TGTATCTCTT 



TCCCAAGATG 
AAGGACCAAA 
TTTCTTGGAT 
GACCACTTTT 
GTTGCTTTGT 
CTACAAGCTG 



AACGTATTTT CATTTTTCCA 
AGATTTTTGA AACAGTCAAT 
ATGGTCAGGA AAGATTAACG 
TGCATTATCT CTTAAACAAG 
CTCGTGAAGA GAGGCTTTAT 
CTATTCGAAA AGGTCTTTAT 



6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7S60 

7620 

7680 

7740 

7800 

7860 
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AAGGAATATC ATCGATTTTC TCATAACGAC AGTCATGTTA AGGGAGTGAT TGATGTAAGA 7920 

AACCATCTCA AGAAAAATCT TCCTTTCACG GGAAATATTG CCTACGCAAC GAGAGAGTTC 7980 

ACCTATGATA ATCCCCTCAT GCAGTTGGTC CGTCACACTA TTGAATACAT TAAGAATCAG 8040 

AAAAGCATTG GTCAAGGGGT ACTAGATAAT CTCTCAACTA GTCGTGAAAA CGTATCTGAA 8100 

ATCCTGCGTG TAACGCCCTC TTATAAACTA GCTGATCGTG CTAAGATTAT TCGGGGAAAT 8160 

CAATCTAAAC CTATACGTCA TGCATACTTT CACGAGTACA GAAACTTACA AGAACTTTGT 8220 

CTGATGATCC TAAACCAAGA AAAGCACGGT TTAGGGTATC AAGATCAAAA AATCTATGGT 8280 

ATTCTCTTTG ATGTTGCCTG GCTTTGGGAA GAGTATGTTT ACACCTTGTT GCCAAAAGGT 8340 

TTTGTACATC CCAGAAATAA GGATAAGACG GATGGAATTT CAGTATTTTC TGTTGGGAAA 8400 

CGAAAAGTAT ATCCAGATTT TTATGACAGA GAACGAAAGA TTGTTCTAGA TGCAAAATAT 8460 

AAAAAACTGG AATTGACTGA AAAAGGAATC AACCGTGAGG ACTTATTCCA GCTGATTTCC 8520 

TATTCTTATA TTTTAAAAGC TGAGAAGGCT GGACTGATTT TTCCTAGTAT GGAGCAGTCA 8580 

GTAAATAGTG AAATAGGAAA AGTAGCTGGC TATGGAGCTC AATTGAAGAA GTGGTCTATT 8640 

CGAATCCCTC AGAATGCCTC ATTCTATAGT ACATTTTGTA AAATGATGGA AAATTCAGAA 8700 
GAG 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



8703 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CATCACCAGT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 60 

CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGAGAAG CACTACGAGA 120 

ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 180 

TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGGTA 240 

GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TTATGGCTAT GAGAAAACAT 300 

TTAAATCACT TAACCAATTG GAACAAGCCA TTATAGACTA TATTGATTAT TACAACAATA 360 

AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 420 

GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 480 
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GACCTTTTCA TAAGGTTCGC TTGATGTACC AAGATGAGGC TGGTTTCGGT AGAATCAGTA 540 

AACTGGGATC TTGTTGGTCT CCAATAGGAG TAGGTCCACA TGTCCATAGT CACTATATAC 600 

GAGAATTTCG CTATTGTTAT GGAGCTGTTG ATGCCCATAC AGGCGAATCA TTTTTCTTAA 660 

TAGCTGGTGG ATGTAATACT GAGTGGATGA ACGCCTTTTT AGAAGAGCTT TCACAAGCTT 720 

ATCCAGATGA TTATCTTTTA CTCGTTATGG ACAATGCTAT ATGGCATAAA TCAAGTACCT 780 

TAAAGATTCC GACTAATATT GGTTTTACCT TTATTCCTCC ATACACACCA GAGATGAACC 840 

CATTGAACAA CTGTGGAAAG AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAC 900 

TTTGGAAGAT GTCATGAATC AACTCCAAGA TGTCATACAA GGATTGGAGA AGGAGGTGAT 960 

AAAGTCCATC GTTAATCGGA GATGGACTAG AATGCTTTTT GAAAACAGAT GAGTATAAAA 1020 

TTGAATTGCT TATAAAAAAG CTCCATACAC TGGATGTGTA TAGAGCAATG GGGCTTTATT 1080 

TGATATAGAG TTCTTGGTTT TTTAGGACAA TTTCTCGGAT ACTTGCAAAC TTTTTAAGTT 1140 

TTTTGATTTC TTCTGGATGA GTGACGAGAG TGATAACATA ACCTTCCTTG CCCATACGAC 1200 

CAGTACGGCC AGCACGGTCT GTGTAGGTTT CGCTATCTCT AGGAATATCA AAGTITACGA 1260 

CACATTCTAG GCTATCGATA TCAATTCCAC GAGCCAAAAG GTCAGTTGCA AGAAGCAGGG 1320 

TTAGTTGGTT ATCTTTAAAC TTTTCTAAGA TGATTTTTCT AAATTTAACA TTAACATCAC 1380 

TAGCGAGGGA AACAGCCAAT ATATCACGAT ACTGTAGTTT TTCCTCCGCA TTCCCAAGGT 1440 

CTGACAGGCT ATTGAAGAAG ACTAGACCAC GG AAATCCTC TACATGAGCC AGTTTTCGTA 1500 

GCATATCCAC TCGATGACGT TGGTCTACCT GCATGTAGAA ATGCTGGATA TTCTCCAATT 1560 

TTTGATCAGA GAGATCAATA GTGCGTGTAT TCGGCACAAT CTTTTCTTGG TCAAACTTGG 1620 

TCGTGGCACT CATGTAGACC AGTTGGTGGT CACGAGGTCC GTAGTGAGTG ATTTTTTCTA 1680 

CAAAGTGAAT CTGAGAATCA TCTAGTAATT GGTCAAATTC ATCCAGGATG ATGGTTTCCA 1740 

CATTCATCAT CTTGATTTTT TTAAGTTTAA TGAGTTCAAA GATACGGCCA GGAGTTCCAA 1800 

TCAGAATTTC TGGCCCCTTT TTAAGACGTT CAATTTGTCG TTTCTGACTT GAACCTGAAA I860 

GGAAGAGTTG AGCAGTCAAT CCGATAGCTT CTGCCCACGT TTTACATACA TCAAAAATCT 1920 

GTCCAGCAAG TTCCGTATTT GGTGCTAGAA TCAAGAGTTG TTGGGCTTTT TTCTTTTCTA 1980 

GTCTGAGAAG ACTTGGTAGG AGATACGCTA GGGTCTTACC AGTTCCGGTT TGGCTCACTC 2040 

CTAGGAGGTT TTCTCCAGCA AGAAGGGGCT CAAATAGTTG AGTTTGAATG GGGGTGAATT 2100 

CTTGCAAACC GAGTTGGTCA CTCAGTTCTT GCCATTCAGT CGGTAGTTTG GTTTTCATTT 2160 

TTCTGCCTCA AATCTAATGC CAGCACTCTG GCGCATGGTA TATAGTAGCT CATGAACAGA 2220 

GCCTGCATCA TACAGCCAAC TTTGGTACAG ATTCAGATCT GGTTGCTGGA TCATGTGTGC 2280 
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AAATGCAGCC ACTTCCTCAG TCATCGTATG AGGAGCCTGT TGGATAGGAA GCTGGACTTG 2340 
ATTTCCTTGG TGGTCGGTAA AAATAGCTGA GCGAATATGC TCAATCGTGT TGAGAGTCAA 2400 
GGTTCCATCT GTTGTATAAA TCTCGCAAGG AAGATTGGAA GTGATGTTTT TTCCAGCCTT 2460 
GATGTGAACT TGATAGTCTG GGTAGAAGAG GATACCATCT CCATTTAGGT CAATGCTATT 2520 
GTCAAGCTGT TGAGCATGGT AAGTCGCGTC ATTGGCTTTT CCAAAAAGAC CAACACCAGC 2580 
ATAGAGGGGA TAAATCCCCA AATCCATGAG GGCTCCACCA GCAAAACGGT CTGAAAAGAC 2640 
ATTTGGTGTT TGTCCAGCCA ACAAGTCAGG CATCTTGGAA GAGTATTTGG CATAGTTGAA 2700 
ATCTGCTCCT AACACTTGCT TATCTGCTAA AAAGTTTTTG ATAGTAGTAA AGGCTTTCTC 2760 

GTGGTAATTA CGAGCTGCTT CAAAGATAAA ACAGTTATTT TTTTCAGCTG TTTGAATCAA 2820 

ATCAAACCAT TCTTGTGGTT GAGAGACAGC TGGCTTTTCG AGAATAACAT GTTTACCAGC 2880 

AGACAAGGCA GCTTTTGCCT GAGCAAAATG TAAGGAGTTT GGACTGGCGA TAT AG ACT AA 2940 

ATCAAAAGAA GATTTGAAGA AGACTTCTAA TTGATCGAAT AGTTGGATAT TCTGATAGCG 3000 

AGAAGCAAAG GTTGCTGCAG TTTCTAGTTT TCTAGAATAG ATTGCGACCA GTTGGTATTC 3060 

TCCACTGGTA TGGGCTGCTT CTATGAAATG ATGGCTGATA GCGCCAGTTC CGATGACACC 3120 

TAATTTTAGC ATAAATACTC CTTTTCCGAT TTTAAATCCT TCTTTCATTA TAACATAGAT 3180 

AGACGGGACT ATCCAACAGA GAGGAGAAAA TTTCAAATAA GCTATTAGCT TTCTTTTCCG 3240 

AATAAATAGA TAGAAGCATA GAATCTAGCA AACCTAGATT TAAAAATGTG CTATAATAGA 3300 

AGGAGGAAAA GGAGGATTCT CAGACATCTA GGTATCAGCC CAACTAATCA TTTGTCAATT 3 360 

TATCCGCGAT ATGCTGGACT TGCCAGCAAA AAATGTGACG ATTTTGGAGG GAAGTAACAT 3420 

TCACGTCTTG CCTTCCATGC CCTACTCACC GTAAGATTTC TATACTAGTA TAGACGTCTT 3480 

GGCGGAGTTA GATAATGGAA TCCAAGTTAT CATCGAAATT CAGGTTCATC ATCAGAATTT 3540 

TTTCATCAAT CGCCTATGGC CTTATCTGTG CAGTCAGGTT AATCAAAACC TAGAAAAAAT 3600 

TCGCCAACGT GAAGGTGATA CCCACCAGAG CTACAAACAA ATCGCACTAG TATACGCTAT 3660 

CGCAATTGTC GATAGTAATT ACTTCTCAGA TGACCTAGCT TTTCATAGTT TTATAGTAAA 3720 

ATGAAATGAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA CAATCTTTTA 3780 

GAAGTATAGG TCTACTATTC TACCTTCAAT CTACTAGAAA TTCCATAGAT AGAAAACTAC 3840 

ATAATCTCTA CAGATACGGA TGTTGGAGTT GATGTAAGAT GCTTTGGCTT GCTAGAGGAA 3900 

TTGTCGATTG CCAAATTGTA TCATTGAAAT TATTGCTCAA ATTTGTTATG ATATAAATAT 3960 

GAATAAAAGT AGACTAGGAC GTGGCAGACA CGGGAAAACG AGACATGTAT TATTGGCTTT 4020 
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GATTGGTATT TTACCAATTT CTATTTGCCT ATTAGGCGCA TTTATTGCTT TTAAGATCTA 
CCAGCAAAAA AGTTTTGAGC AAAAGATTGA ATCGCTCAAA AAAGAGAAAG ATGATCAATT 
GAGTGAGGGA AATCAGAAGG AGCATTTTCG TCAGGGGCAA GCCGAAGTGA TTGCCTATTA 
TCCTCTCCAA GGGGAGAAAG TGATTTCCTC TGTTACGGAG CTGATAAATC AAGATGTTAA 
GGACAAGCTA GAAAGTAAGG ACAATCTTGT TTTCTACTAT ACAGAGCAAG AAGAGTCAGG 
TTTAAAGGGA GTCGTTAATC GTAATGTGAC CAAACAAATC TATGATTTAG TTGCTTTTAA 
GATTGAAGAG ACTGAAAAGA CCAGTCTAGG AAAGGTTCAC TTAACAGAAG ATGGGCAACC 
TTTTACACTT GACCAACTGT TTTCAGATGC TAGTAAGGCT AAGGAACAGC TGATAAAAGA 
GTTGACCTCC TTCATAGAGG ATAAAAAAAT AGAGCAAGAC CAGAGTGAGC AGATTGTAAA 
AAACTTCTCT GACCAAGACT TGTCTGCATG GAATTTTGAT TACAAGGATA GTCAGATTAT 
CCTTTATCCA AGTCCTGTGG TTGAAAATTT AGAAGAGATA CCCTTGCCAG TATCTGCTTT 
CTTTGATGTT ATCCAATCTT CGTACTTACT CGAAAAAGAT GCGGCCTTGT ACCAATCTTA 
CTTTGATAAG AAACATCAAA AAGTTGTCGC TCTAACCTTT GATGATGGTC CAAATCCAGC 
AACGACCCCG CAGGTATTAG AGACCCTAGC TAAATATGAT ATTACAACCG GGGT 
(2) INFORMATION FOR SEQ ID NO: 179: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4854 



60 
120 
180 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
TAAACAGGTG TTAGGTGCTC TAAACTATTA AAATTCTAAG GAAATAAGGC TACTTTTTCT 
GGGTCTTGTT CATAGTAGGT GTGGTTCTTT TTTTCGAGTG TAGCCCATAG CTTTGAGCGC 
ATAGTGGATG GTAGTTGGAT GACAGCCAAA TTCAGAAGCT ATTTCAGTCA AATAAGCA7C 
TGGATTGTCA GTAAGATAGT TTTTAAGTCT ATCTCTATCA ACTTTTCTTG GTTTTGTTCC 240 
TTTTACTTGG TGGTTTAGCT CTCCTGTTTT CTCTTTTAGC TTTAACCAGC CATAAATGGT 300 
ATTACGTGAG ATTTGGAAAA CGTGTGATGC TTCTCTTATA CTACCTGTTC GCTCACAATA 
AGAGACAACT TTTTTACGAA AATCTATTGA ATATGCCATA AGAAGATTAT ACCACATTGT 
GTACTATTTT TGGTTCATTT TACTATATTT CTAAACACTT AGAAATAATA AAACAAATTA 
AATATTATTT CTAAATATTT GAAAATAACA TCTATTTGTA TTATACTATC TTTGAGGTAA 540 
CTATTATGAA CTATATCAAA AGACCACATT ATTTAGATTT TTTAAGAAAA CATCGTGACC 600 



360 
420 
480 
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GACCAATCAT 


CAAAGTTGTG 


ACTGGAGTTA 


GACGAGCTGG 


TAAATCTGTG 


CTTTTTCAAC 


660 


TCTATAAAGA 


GGAGTTACTA 


GCAACTGGGG 


TAGACGAGGA 


TCAGATTATA 


TTCATCAATT 


720 


TCGAAGATTT 


GAGTTACTAT 


GATCTGCGAC 


ATTTTCAAAC 


ATTATTCGCT 


TATATAAAAG 


780 


ATCAATTAGT 


TAGCAAGAAA 


ACATACTATA 


TCTTTTTAGA 


TGAAATTCAA 


TATGTTGAAA 


840 


AATTTGAACT 


GGTAGCAGAT 


AGTCTATTCA 


TCTTAGCAAA 


TGTAGACCTC 


TATTTGACTG 


900 


GATCTAACGC 


CTACTTTATG 


AGTAGCCAAT 


TAGCAACAAA 


CTTGACTGGT 


CGGTATGTTG 


960 


AGATAGAGGT 


TCTTCCTTTG 


TCATTTGAAG 


AATATCTATC 


AGGTCAATCT 


CTCACAGAGA 


1020 


ATCTGAATAC 


AACAGAAATT 


TTTAACAATT 


ATCTCTTTAG 


TGCTTTCCCT 


TACTTATTGC 


1080 


AAACATCATC 


TTACGATGAA 


AAAATTGACT 


ATCTCAGAGG 


AATATATAAC 


TCCATACTGT 


1140 


TAAATGATAT 


TGTCACTAGA 


TTGGGAAAAC 


CAAATCCTAC 


TATTATTGAG 


CGCATTGTCC 


1200 


GAACCCTTCT 


CAGTAGTACA 


GGTAGCTTAA 


TATCAACAAA 


TAAGATTCGC 


AATACCCTAG 


1260 


TCAGCCAAAA 


TGTTTCAATA 


TCCCATAATA 


CTTTGGAAAA 


TTATTTCACA 


ACTTTGACAC 


1320 


ATAGTTTACT 


TTTTTATTCC 


GTTCCACGTT 


TTGATGTAAA 


AGGTAGAGCA 


TTATTGCAAC 


1380 


GTTTAGAAAA 


ATATTATCCC 


GTTGATTTAG 


GTTTACGACA 


TCTCTTATTA 


CCAGACCAGA 


1440 


AAGAAGACAT 


TAGGCATATC 


TTGGAAAATA 


TGGTATATTT 


GGAATTGAGA 


CGTAGATATT 


1500 


CACAAGTATA 


TGTTGGTAAT 


TTAGATAAGT 


ATGAGGTTGA 


TTTTGTTGTT 


GTAACTGATC 


1560 


TTGGCCACTA 


CGCTTATTAT 


CAGGTCAGTG 


AAACAACACT 


TGCTCCAGAA 


ACACTAGAAA 


1620 


GAGAACTTAG 


ACCACTAGAA 


GCCATTAAAG 


ATCAATTCCC 


TAAATATCTA 


TTAACAATGG 


1680 


ATACGATTCA 


GCCAACAGCC 


AATTACAATG 


GAATCGAGAA 


GAAAAGCATT 


ATAGATTGGT 


1740 


TACTAGAAAA 


ATAGATAAAT 


ATAAATCATA 


CAGCTAATTA 


GATTTGCAAC 


AGTCTGTTAT 


1800 


CAATGATTCT 


ACCCAAATCC 


TAACAAGATA 


TAGTGAATTT 


CGAATACGCT 


ATATAATACG 


1860 


GACACTTGAA 


AATAGAAATT 


GGGGATGAAA 


GGGGATCTAT 


AATTTCTGGA 


AGTACTATCA 


1920 


AAAATTAATA 


TCATAGTCTT 


ATTAGAGAAT 


ACCATCACCC 


ACTTTCTCAA ATAAGATTAA 


1980 


ATTGTAACTG 


AATTATAATG 


AAAAAGAGAC 


TGAGCAATCA 


GTCTTTAAAA 


TCAGAAAAGC 


2040 


GCATAGTATC 


AGGTATTGAA 


CAACCTTGAT 


AATATGCGTT 


TTATTATGGA 


AATATTTGCT 


2100 


TCATTTTCTC 


CTGAAATAGA 


GCTTTTGCTA TCCTATTTTT 


CTCTATTTCT 


AATGATTTAC 


2160 


TTCAACTTCT 


TACCTCTTGG 


GAAAAA 








2186 


(2) INFORMATION FOR SEQ ID NO: 180: 









U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 
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(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



<xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


180: 






GTCACACGTT 


TGACTTCACG 


TATTTCATAA 


GTATAAACTT 


TATTTTTATC 


GGTTAGATAA 


60 


ATCTTCATGC 


CATTTTTAGC ATTATCTAAA GGAGAAAATA 


ACATTTTATT 


AGCATTATCA 


120 


ACACCAAAGA 


TATGGTGACT 


AGCTAGACTA 


TAATTTCCTT 


CTCCCATTAC 


TTGCTCGCGT 


180 


TTCATTGTAC 


CAGCTCCGTA 


GAAGAGATTA 


ACATTATCAA 


GTCCTTTAAA 


AATCGGCAAA 


240 


TTCATTTCCA 


ATTCAGGAAT 


TGCAATTCCC 


CCAATAACTG 


GTAATTTTTG 


AGCATCCCAT 


300 


TGAGAAGTTA 


GAACAGCTTC 


CGAAGAGATA 


GCTTTGACAG 


AATCAAAGTC 


AAAATTCCCT 


360 


TCTGTATCCT 


GATTTTCTTC 


TAATTTTTCT 


TTTGATACCT 


GGCTAACTTG 


ATACTTATTG 


420 


GTATTCCAGA 


CTATGAAAAT 


ATTTCGAATT 


TGAGTATTAA 


AAATCAAAGC 


CAGTGACAGT 


460 

T W *w 


AATATCAGAA 


ATCCTGCTAG 


GATATTTGTC 


AGCAGATTTT 


TTCGCTTGTT 


TTTCTTTTTA 


540 


TTATTTTTTT 


GAGACATTAT 


GCTTCACCTT 


CTGTTTCCTT 


TTCTGTCCCA 


ACTTCTTCTT 


600 


TTTCTGCCAC 


CGCAACCGTT 


GTGAAAGTCA 


CTATCTGAGC 


ATCTTGATCC 


AGGCGCATTA 


660 


CTTTAACTCC 


CATAGTTGCA 


CGTCCTGTTT 


GTGAAATATT 


GGCAAGATTG 


GTTCGAATCA 


720 


TGACACCTGT 


ATCAGTGATA 


ATCATCAAAT 


CCTCATCCCC 


TTGAACAGTC 


ATAAGACCGG 


780 


CCAGCAAGCC 


ATTTTTTTCG 


GTAATTTTAG 


CTGTCTGCAT 


TCCCTTACCA 


CCACGACCTT 


840 


TTGTTGGGTA 


TTCAGTAGCG 


ACTGTACGCT 


TACCATATCC 


TTTTTCTGTG 


ATAATAAGAA 


900 




ATCAGTAATC 


AAGCTGGCAC 


CAACAACTGT 


GTCTCCTTCA 


CGAAGGTTAA 


960 


CACCTTTCAC 


ACCAGTGGCG 


ATACGGCTCA 


TACCACGAAC 


GGCTGATTGA 


TTAAAGCGAA 


1020 


CTGCATAACC 


AAACTTGGTA 


CCAATGATAA 


TATCCATATC 


TCCTTCTGCC 


AACAAGACAT 


1080 


TGATTAACTC 


ATCTTCATCC 


TTTAAATTCA 


GCGCTTTGAG 


ACCATTTTGA 


CGAATATTGG 


1140 


CAAACTCCTT 


AACACTGGTT 


CTCTTCACAA 


TACCGTGACG 


GGTTGTAAAG 


AAGAGATAAG 


1200 


CATCATCACT 


GCGATCAGAC 


TCAACATTGA 


TAACCGTCTG 


AATACTTTCG 


TCTTCATCCA 


1260 


ATTTCAAGAG 


ATTGACTACT 


GGTAGCCCTT TGGCAGTCCG 


ACCATACTCA 


GGAATTTCAT 


1320 


AACCTTTAAG 


ACGATAGACA 


CGTCCCTTGT 


TTGTGAAGAA 


GAGCAGATGA 


TCATGGGTGC 


1380 


TAGTTGACAC 


TAACTCACGA ACAAAGTCAT 


CATCTTTCAC 


TCCCGTTCCT 


TGGACACCAC 


1440 


GACCCCCACG 


TTTTTGAGCA 


GTGAACTCGT 


CCTGATCCAA 


ACGCTTAATG 


TAGCCTCTGT 


1500 


TAGAAAGGGT 


AATCAAGACA 


TCCGATTCTT 


CAATCAAGTC 


CTCATCCTCG 


AGACTCAAGA 


1560 
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CCTGTCCAAT CATCAACTCT GTACGGCGCT TATCAGAAAA TTTACGTTTA ACTTCATCCA 1620 

ATTCGTCTTT GATAATTTGA GAAACACGTT CAGGCTTAGC AAGAATATCT GCTAAATCCG 1680 

CAATCAGAGC CAAGAGGTCA TCATACTCAG ATTGAATCTT ATCGCGTTCC AAACCTGTCA 1740 

AACGACGAAG ACGCATATCA AGGATAGCTT GACTTTGACG TTCAGAAAGC TTAAACTTGC 1800 

TCATCAACTC AGCTTGAGcT TCCGCATcCG tTTCACTACC ACGGATGATA CGAATCAyTC I860 

GTCCATATGG TCTAGCGCAA TCAAGAGACC TTCTAAGATA TGAGCGCGCG CTTCCGCTTT 1920 

TTCCTTATCA AAACGTGTAC GACGAACAAC CACTTCTTTT TGGTGCTCGA TATAAGCATC 1980 

CAAAATCTGA CGAAGAGACA AAATTTTCGG TATACCATTT TGGATAGCGA GCATATTGAA 2040 

ACCAAAATTG GTTTGCATTT GGGTCATTTT GAAGAGGTTA TTGAGAATAA CATTGGCTGA 2100 

GGCGTCGCGC TTGACTTCAA TAACAAATCG AACACCTTCA CGGTTTGACT CATCACGTAC 2160 

TCCTGTGATA CCCTCAATGC GTTTTTCCTG AACCAAGCGA ACAATATGCT CATGCACCTT 2220 

GGTTTTATTG ACCATGTAAG GAAATTCTGT TACAACGATA CGCTCACGAC CAGTCTTAGT 2280 

CGTTTCAATC TCTGTACGAG AACGTAGGAC AATCGAACCT TTACCTGTTT CATAAGCCTT 2340 

ATGGATACCT GATTTCCCCA TGACAAGAGC ACCAGTTGGA AAATCTGGTC CAGGCAAGAC 2400 

TTCCATCAAG TCCTTGGTAG TCACTTCAGG ATTATCCATG ACCAACTTCA CTGCATCAAT 2460 

gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA TACCAGTTGC 2520 

TCCATTAACC AAAAGGTTTG GAAAACGCGC TGGCAAGACC AAGGGTTCCC GTTCATTGGC 2580 

ATCATAGTTA TCAACGAAAT CAACTGTATT TTTGTTGATA TCACGAAGCA TTTCCAGAGC 2640 

AATCTTGCTC ATACGTGCCT CGGTATAACG TTGAGCCCCA GCACTATCTC CATCCATGGA 27O0 

ACCAAAATTC CCATGACCAT CTACAAGCAT GTAACCGTAG CTCCACCATT GAGCCATACG 2760 

GACCATGGCT TCATAAATAG AGGAATCCCC GTGTGGGTGA TATTTACCCA TGACATCCCC 2820 

TGTAATACGA GCAGATTTTT TATGGGGTTT GTCTGGGGTC ACACCCAATT CATTCATTCC 2880 

GTAGAGAATG CGACGGTGAA CAGGTTTTAA GCCATCTCGA ACATCAGGAA GAGCTCGCGC 2940 

TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC ATCTCCTTTG TCAGATTGAC 3000 

ATTCACTAAA TTTTTATCCT GCATTAATAA ATGCCTCATT TCACAATTAG TAAGTAACAA 3060 

CATTATACCA TAAATTCCCA TCTATTTCAG CCTCTAAACC ACTAAAACGT TTACATCGAG 3120 

AACTATAAGG CATATTCGTG ACAAAGTTTT TTAAAAGTGA TACAATGAAG TTGTCTAGGG 3180 

AAAACCCCTA ATAGAATAAG GAGATGGTTA nACAATGACT CTGACTAACA CACAAA 3236 
<2> INFORMATION FOR SEQ ID NO: 181: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



AGGTCCTGAA 


GTATTGGAAC AGGAAGGTCA AGAGTTTTTG GAACATTTCA 


AAAAACTCTT 


60 


GGAGTCAGTT 


GAAGTAGTAG 


CCATCTCAGG TAGTCTGCCA GCTGCCCTTC CAGTTGATTA 


120 


CTATGCGAGC 


TTGGTAGAAC 


TTGCTAATCA AGCTGGCAAG CATGTAGTCT 


TGGACTGCTC 


180 


AGGTGCAGCA 


CTTCAGGCTG 


TTCTTGAATC ACCCCATAAA CCAACAGTCA 


TCAAACCAAA 


240 


TAATGAAGAA 


TTGTCTCAGC 


TTCTTGGAAG AGAAGTTTCT GAGGATTTGG 


ATGAATTAAA 




AGAAGTACTT 


CAAGAACCTT 


TGTTTGCAGG GATTGAATGG ATTATCGTTT 


CACTTGGTGC 




CAACGGTACT 


TTTGCCAAAC 


ATGGTGACAC TTTCTACAAG GTAGATATTC 


CTAGAATTCA 


420 

t» * V 


GGTGGTAAAT 


CCTGTTGGAT 


CTGGAGACTC TACTGTGGCA GGAATTTCTT 


CAGGACTTCT 


4B0 
now 


TCACAAAGAA 


TCGGATGCAG 


AATTACTCAT CAAGGCAAAT GTCCTTGGTA 


TGCTCAATGC 


SdO 


TCAAGAAAAA ATGACTGGTC ATGTCAACAT GGCCAACTAT CAAGCTCTAT 


ATGATCAATT 


fiOO 


AATAGTAAAA 


GAGGTATAAA 


ATGGCTTTAA CAGAACAAAA AOGTGTACGC 


TTAGAAAAAC 


w a w 


TTTCTGATGA 


AAATGGTATC 


ATCTCAGCTC TTGCATTTGA CCAACGTGGT 


GCTTTGAAAC 


720 

* 4* V 


GCCTCATGCT 


TAAACACCAA 


ACAGAAGAAC CAACTGTGGC CCAAATGGAA 


GAACTTAAAG 


780 


TCTTGGTAGC AGATGAATTG ACTAAATATG CTTCATCTAT GCTTCTTGAC 


CCTGAGTATG 


840 


GACTTCCAGC 


AACTAAAGCT 


CTTGATGAAA AAGCTGGTCT TCTCCTTGCT 


TATGAAAAAA 


900 


CAGGTTATGA 


CACAACAAGC 


ACAAAACGCT TGCCAGACTG CTTGGATGTT 


TGGTCTGCAA 


960 


AACGTATTAA 


AGAAGAAGGT 


GCAGATGCAG TTAAATTCTT GCTTTACTAT 


GATGTAGATA 


1020 


GCTCAGACGA 


ACTCAATCAA 


GAAAAACAAG CCTACATCGA ACGCATCGGT 


TCTGAGTG*J*G 


1080 


TGGCTGAAGA 


TATCCCATTC 


TTCCTTGAAA TCCTTGCTTA CGATGAAAAA 


ATTGCGGATG 


1140 


CAGGTTCTGT 


AGAATACGCT 


AAAGTAAAAC CACACAAAGT TATCGGCGCT 


ATGAAAGTCT 


1200 


TTTCAGACCC 


ACGCTTTAAC 


ATTGATGTTT TGAAAGTTGA AGTTCCTGTT 


AACATTAAAT 


1260 


ATGTTGAAGc 


IcTCGCTGAAG GTGAAGTAGT TTATACACGT GAAGAAGCAG 


CAGCCTTCTT 


1320 


CAAAGCGCAA 


GATGAAGCAA 


CGAACTTGCC ATACATCTAC TTGAGTGCTG 


GTGTATCAGC 


1380 


TAAACTCTTC 


CAAGATACTC 


TTGTATTTGC TCATGAATCA GGTGCGAACT 


TTAACGGAGT 


1440 


TCTTTGTGGC 


CGTGCTACAT 


GGGCAGGATC AGTTGAAGCT TACATCAAAG 


ATGGTGAAGC 


1500 



WO 98/18931 



PCT/US97/19588 



AGCAGCTCGC 
TTCAAAGAAC 
CATGAATCTA 
TGCTATACTT 
AATAGGATAT 
ATAAACTATA 
TACTAAAAAG 
AGCCTTACCT 
AGCAAATGAT 
AGATATTGTA 
TATAAATTTA 
AGCAAGTTTG 
AGGTTTTCGT 
TTTCGCTACT 
TGGATTTGTT 
TTATTGGACA 
AGATTTACCT 
GTTTTATAGT 
CTTTTTTATT 
GCCTCTGTTT 
TTACAAAATA 
TAAGTAATAT 
GTGGTCATTT 
TTCTAAATCC 
TTGTAATTGG 
AGGTTGAGAA 
TCAAAGATGT 
AGAAATATCA 
GTTCTTATCA 



1123 

GAATGGtCGC ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 
AGCAACTTCA TGGAAAGAAC GCGTCTAAGA AAGTCCTCCT AGTTTAGGAA 
AAAAAATTTA AAAAAAGTTG TATGTAAAGG CTTACAAAAT AACTTACTTG 



AAATCACAAG 
AGTTTATTTT 
TTTAGATGCA 
AGTTTTAAAG 
ATGGTTAGTG 
ATTGTTTTGA 
AATGCTGGTC 
AAAGGCACTC 
ATGTATGGAG 
TTCAATCTTG 
GGATATGCTG 
GCTGTAATAA 
GTTGCTGTAG 
TAGAGGTTAT 
TTCTTGTGTT 
TTTCTGTCTT 
GATGAATTTT 
ACTTACTTGT 
TAGGCATGAT 
TTTGTACTTA 
AATGAATCAC 
TAAGGGAATT 
AATCTTTCGG 
TCCGCTTGAT 
TTATCCGATT 
AGCTCTAACT 



TTAATATGAA TTAGAAAGTA ACTATATGAA GTATAATAAA 
ACGAGCTAGG AACGAAAAAT ACGGAAACAA TATTGCCAGA 



CATTTCATTC ATTGTTTTAT 
CGTTAGTTGT AGGACTAGGT 
GTTCTGTATT TGCAGATAGT 
ATGTTGATGG GAATAAATTT 



AAAAGGAGAA GATAAACGGC 
ATTGTTTCAA TATTCTTATC 
GCCCTAACTA CAGTAGATAA 
TATAATGTTT CGGTTTCAGA 



AAATTTTGGA AGATTATTTT 



CTGAAGAGTT AGCAAAAAAT 
CTGTAAAAGA GTTACCCAAC 
GTCCTCAAGT GAGGGGGATG 
GATGGCATTT GAAACAATTT 
GTGGTGCGAT TGGCTGGGCT 
CTACAGTAGA AGTGCCGTTT 
TTCTTTATGA ATCATTCTTT 
CTTTGTTTAT TAGACTTAAT 
CCTGTTTTTG TTTTGATTTA 
AGAACATAGT TAAGTTTTAA 
GCTATACTTA AATCACAAGT 
CACAGGTGAA TTAGAAATCA 
TATACCTTTA AGATATAAAA 



AATGTCTCGC 
GCATTCGGAA 
ATGAAGACCG 
TTTATCACAG 
CAAGAGTATC 
CAAGGAAGGT 



TTGTCAGAAA 
AGAAGAAGGG 
AAGAGTCCAG 
TGACCTATGA 
TCTATGTAAC 
ACAAGGATAG 



TATGTAGATA 
ATTGGTATTT 
GTTTACGAAA 
GGTGGCTGGG 
GCGGTTAATC 
GTAAAAACTG 
GTGAACCTTG 
TAAAAAAATA 
GAATTTTAAA 
CAAAAATAAA 
AAAAAGTTGT 
TAATACAAGG 
GCTGATTTTC 
GGAGGTTGAC 
TGATAAGGGA 



AATTTGGAAA 
CTGTACAAGA 
GAGGTCCTGT 
CTGCTGGAGC 
CTGTTACATC 
CTGTAGAAAA 
TTTACACCAT 
ACTGTATTTT 
AATGTAGCTA 
TAAAAACAGA 
ATGTAAAGGT 
TGAGTGTTAC 
TAGTTCATTT 
ATGTATCGAA 
GAAGAGGTGA 



GGATTTGATT GCTGAAAATC 
AGAAAACTTT ATGGCTCTTC 
AATCATTGAT AAGCTATCAA 
CTTCACAGAT CATATTTACT 
TAATCTGCCA GATATTTCCG 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

I960 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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CTAAGTATCC TGTCGCTTTT CAAATCCCAA ATGAAGCTTT TGAAATTTAC CGTCAGAAGC 
TAGCAGATCA TTTTCCTGAG GACGAAATTA TTCGGATTGC TTATCATTTC ATTAATGCTG 
AAGGTGAAAA TGAAGTGGAA CTTGTGGAGT CGATTGATAA GAGGAAAGAA ATTCTCAGGA 
ATGTTGAAGA AGTTTTAACG GACTATGCAA TTCAACGAAC TAAAAAGAAT AACCATTTCT 
ATGATCGCTT TATGATCCAT TTGAATTATT TCTTGGATTA TTTAGACAGA TCTAGAGATG 
ATAACCAATC ACTTCTGGAT ATGGAAGATC ATATTAAACA ATCCTATCCA AAAGCCTTCG 
AGATTGGTTC CAAGATCTAT GATGTGATTA CGCAACATAC GGGTCTTGAT TTGTATAAAA 
GTGAACGAGT TTATCTAGTT CTACATATCC AACGTTTATT GTCATAAAAA TTTATTTAAA 
ACTATATAAG GAGAATTCTA TCATGAATAG AGAAGAAGTA ACATTGTTAG GTTTTGAAAT 
CGTAGCCTAT GCTGGCGATG CTCGTTCAAA ACTATTGGAA GCCTTGAAGG CTGCTGAAGC 
TGGTGATTTT GAAAAAGCGG ACGCTCTGGT AGAGGAAGCT GGTAGCTGTA TTGCAGAGGC 
TCACCACGCG CAAACAAGTC TATTGACTAA GGAAGCTTCA GGTGAGGACT TGGCTTATAG 
TGTAACCATG ATGCATGGCC AAGACCACTT AATGACAACT ATCTTGTTAA AAGATTTGAT 
GCATCATTTA ATTGAACTCT ACAAGAGAGG AGTTCAATAA TGAATAAACT AATTGCATTT 
ATCGAGAAAG GAAAGCCTTT CTTTGAAAAA CTATCTCGTA ATATCTATCT TCGTGCTATT 
CGTGATGGTT TCATTGCAGG TATGCCTGTT ATTCTCTTCT CAAGTATCTT TATCTTGATT 
GCCTTTGTAC CAAACTCATG GGGCTTTAAA TGGTCTGATG AAGTTGTAGC CTTTCTGATG 
AAACCTTATA GCTATTCTAT GGGTATTCPG GCTCTCTTGG TAGCTGGTAC AACAGCTAAG 



TCATTGACTG 

ACATTGTTGG 

CTAGCTACTG 

ACTGTAGCCA 

GTTCCACCAA 

TCTCTTTATG 

ATCGGTAAAT 

ATCTTTGGTG 

CCAGCTATCG 

GGGATGCATG 

GGTACAGGTG 

AACCGTGCAA 



ACTCAGTAAA CCGGAGCATG GAAAAAACCA ATCAAATCAA GTATATGTCA 
CAGCAATTGT TGGTTTGTTG ATGTTGGCAG CTGATCCTAT CGAAAGTGGT 
GATTCTTGGG GACAAAAGGT TTGCTTTCAG CCTTCCTTGC TGCCTTTGTT 
TCTATAAGGT TTGTGTTAAG AACAACGTCA CTATTCGTAT GCCTGACGAA 
ATATCTCACA AGTCTTTAAA GATGTGATTC CATTCACTCT ATCTGTTGTT 



CTCTTGACTT ATTAGCACGT TATTTTGTTG 
TCTTCGCACC ACTCTTCTCA GCAGCAGACG 



GTTCTAGTGT GGCAGAATCA 
GATACCTTGG TATTACCATT 
CCTTTGCCTT CTTCTGGTTT GTTGGGATTC ATGGTCCATC TATCGTTGAA 

ACTTGAACCT TCTCCAACAA 



CAGCTATTAC CTATGCCAAT GCCGAAGTTA 
CAGACAAGAT TCTTACTTCT GGTACACAAA TGTTTATCGT TACCATGGGT 



CGACATTGGT CGTTCCATTT ATGTTCATGT 
TCGGACGTGC TTCAGTAGTT CCTACCTTCT 



GGTTGACAAA ATCGAAACGT 
TCGGTGTAAA TGAACCAATC 



3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 
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TTGTTTCGTG CACCTCTTCT TTTGAATCCA ATCTTCTTCA TTCCATTTAT CTTTGCTCCA 5100 

ATTGCAAACG TATGGATTTT CAAATTCTTT ATTGAAACTC TTGGAATGAA CTCATTCACT 5160 

GCTAATCTAC CATGGACAAC TCCAGCTCCA CTAGGTCTAG TTCTTGGAAC TAACTTCCAA 5220 

GTGCTATCAT TCATTCTTGC TGCCCTTCTA ATCGTGGTTG ACGTTGTCAT TTACTATCCA 5280 

TTCCTTAAGG TCTATCATGA ACAAATTCTT GAAGAAGAAC GTTCAGGTAA GTCTAATGAT 5340 

GAATTGAAAG AAAAAGTTGC TGCAAACTTC AACACTGCAA AAGCGGATGC TATTCTTGAA 5400 

AAACCGGGTG TCGATGCAGC ACAAAATACC ATCACTGAAG AAACAAATGT CCTCGTTCTC 5460 

TGTGCAGGTG GAGGAACAAG TGGTCTCCTT GCAAATGCTT TGAATAAGGC AGCAGCAGAA 5520 

TACAATGTCC CTGTGAAAGC AGCAGCAGGC GCCTATGGTG CTCACCGTGA AATGTTACCA 5580 

GAGTTTGATC TTGTTATCCT TGCCCCTCAA GTTGCTTCAA ACTTTGAAGA TATGAAAGCA 5640 

GAAACAGATA AGCTCGGTAT TAAACTAGCG AAAACAGAAG GCGCTCAATA CATCAAATTA 5700 

ACTCGTGATG GAAAAGGTGC TCTTGCATTC GTACAACCGC AATTCGATTA AGGCTAGAGA 5760 

CTCTGAAATA GTCTCCCATC GTTACCGAAA TCGCTATGGC GAATTTCCTA TTATTAATTC 5820 

GTCGGTAAAA AGATATCGTT TTTACCTCCT CATGTCACAA TTCGGTGACT TGGTACAAGA 5880 

AGTGAGATGG AGAAGGATGG CTCACTGACT CCTCTCCTCT CACTTTTACT TTATTTAAAT 5940 

CAAGAAATAG GTGAAAAAAA TGACAAAAAC ACTTCCAAAA GACTTTATTT TTGGTGGCGC 6000 

AACAGCTGCT TATCAAGCAG AAGGTGCTAC ACATACTGAT GGAAAAGGAC CAGTTGCTTG 6060 

GGATAAATAT CTTGAGGATA ACTACTGGTA CACTGCCGAA CCAGCTAGTG ATTTTTACAA 6120 

TCGATATCCA GTTGACCTCA AGCTAGCAGA AGAGTATGGT GTCAATGGTA TTCGAATTTC 6180 

TATTGCTTGG TCACGTATTT TCCCGACTGG TTACGGCCAA GTAAATGCTA AAGGTGTTGA 6240 

GTTTTATCAT AATTTATTTG CAGAGTGTCA CAAACGTCAT GTTGAGCCTT TTGTAACTCT 6300 

TCATCACTTT GACACGCCAG AAGCTCTCCA CTCAAATGGA GACTTCTTAA ACCGTGAAAA 6360 

TATCGAACAT TTTGTAGACT ACGCTGCCTT CTGTTTTGAA GAATTTCCAG AAGTAAACTA 6420 

TTGGACAACC TTTAATGAAA TTGGACCAAT CGGTGATGGT CAATATTTGG TTGGGAAATT 6480 

CCCTCCAGCT ATCCAGTACG ACCTTGCCAA AGTCTTTCAA TCACACCACA ATATGATGGT 6540 

GTCTCATGCA CGCGCGGTAA AATTGTACAA AGAGAAACGC TATAAAGGGG AAATTGGTGT 6600 

TGTTCACGCC CTGCCAACTA AATATCCTCT AGATCCTGAA AATCCAGCAG ATCTTCGTGC 6660 

AGCTGAGTTG GAAGATATCA TCCACAATAA ATTCATCTTA GACGCAACTT ATCTAGGTCG 6720 

CTATTCAGCT GAAACCATGG AAGGTGTCAA CCATATCTTA TTAGTCAATG GTGGTAGTTT 6780 
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GGATCTTCGT 


GAAGAAGATT 


TTACAGCATT 


AGAAGCTGCA 


AAAGACTTGA 


ATGATTTCCT 


6840 


AGGAATCAAC 


TACTATATGA 


GTGACTGGAT 


GGAAGCCTTT 


GATGGAGAAA 


CTGAAATTAT 


6900 


CCATAATGGT 


AAAGGTGAAA AAGGAAGCTC 


TAAGTATCAA 


ATCAAAGGTG 


TTGGTCGTCG 


6960 


TGTAGCTCCT 


GACTATGTAC 


CACGCACGGA 


TTGGCATTGG 


ATTATCTACC 


CTCAAGGTTT 


7020 


CTATGACCAA 


ATCATGCGTG 


TGAAGAAAGA 


TTATCCTAAC 


TACAAGAAGA 


TTTACATCAC 


7080 


TGAAAATGGT 


CTCGGCTATA 


AAGATGAGTT 


CGTTGATAAC 


ACTGTTTACG 


ATGATGGTCG 


7140 


TATTGATTAC 


GTGAAGCAAC 


ACTTGGAGGT 


TTTATCTGAT 


GCGATTGCAG 


ATGGAGCTAA 


7200 


TOTAAAAGGT 


TACTTCATTT 


GGTCATTAAT 


GGATGTCTTC 


TCATGGTCAA 


ACGGTTATGA 


7260 


GAAACGTTAT 


GGTCTCTTCT 


ACGTAGATTT 


TGAAACTCAA 


GAACGTTATC 


CTAAGAAATC 


7320 


AGCTCACTGG 


TACAAGAAAG 


TAGCGGAAAC 


TCAGATTATA 


GACTAGTAGA 


ATTAGTCATT 


7380 


AGATATAGAA 


TTTTAGTGAG 


TCAAAAAGAT GTTCAAAGAT 


TTTATCCAAT 


CTATTTATGA 


7440 


AAAAAAGTTT 


ATATTATAAA 


TTTCGAAAAA 


TGCTCTCAAA 


TACCGTGTTT 


GACGAGTGAA 


7500 


GAATTGAAAA 


GTCTTGGAAA 


ATGGTATGTC 


TCGACTGGTA 


AAGAATGGAT 


TTGTCATTCA 


7560 


GATGATGAGC 


TGGAAGAATT 


TAAAAATCTA 


TTTTTAAATT 


TTATCAATCC 


TGAAGAATGG 


7620 


GATACTATCT 


CCTTTGATTC 


AGATTTTATG 


CCGTTTCAAC 


AATCGTAACC 


AATTTCTCAA 


7680 


AAAAGTTAAA 


TCTTATATTT 


AGTACTCTGT 


AAAACTCTTA 


TCTAATCACG 


TTGCTTATAC 


7740 


TCAATGAAAA 


TCAAAGAGCA 


ACTTTAAACT 


AGGAAGCGAG 


TCGCAGATTT 


CTCAATGCAT 


7800 


AGCTTTGAGG 


AATTGGGCAA 


AAAGTCTTTG 


ATATAGAAAA 


ACGCATAGTA 


TCAGGTGTTT 


7860 


CAACACCTGA 


TACTATGCGT 


TTTATTGTGG 


GAAGATTTAC 


tttitttctt 


CTGAAATTGA 


7920 


GTTGTTACCC 


AGGCTCTTTC 


AGTTTATTAA 


GGCTTGATGA 


CTTTAATGTG 


TTTAGATAGC 


7980 


TTAAAAAGGA 


TTGAATCACT 


TAGTTTAGAA 


TCTGAAACAA 


TAGTATCAAG 


ATTTGATACA 


8040 


TTATAAAAAG 


TATAAAAATC 


AAACTTATTG 


AACTTGCTAT 


GATCTGCGAG 


TAAATATTTT 


8100 


TTATTAGAAT 


TATTTAAACC 


GATGCGTTGA 


GCCTCTCCCT 


CTTCCTCGCT 


jam ryv h fit i» 




AGAGCTCCGT 


TTTGAATACC 


ATTACAGCTA 


ACGAAAGCTT 


TAGAAAATTG 


GAGATTAGAG 


8220 


AGATTTTGTA 


GGGTCAATGT 


ACCAACAAAA 


GCACCTGTAA 


TATCGCGATA 


ATTTCCACCT 


8280 


ATTAAAATCA AATCTGTTAA TTTTCGTTCG CTTAAAATCA GAAAAACAGG 


TAGACTGTTG 


8340 


GTTACGACGC 


GGATATTGTC AATAGGCAAC 


TCACGCGCAA 


AAAACTCTAA 


TGTTGTTCCT 


8400 


GGTCCAATGA 


AAATAGTTTC 


TCTTTCTTCT 


ACTAGACTGC 


CTGCAAAATG 


GGCTATTTCT 


8460 


TGTTTTTCTG 


CCGTTTGGAG 


GGCTPGTTTT 


TCAATATTTG 


ATCGCTCATT 


AGTCAAAAGG 


8520 


GAGTTGGTTC 


GAAGTTTTTC AGCTCCACCA 


TGCACACGAA 


TCAGCAAATC 


TTTATCAGCT 


8580 
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AATTCCTCTA AATAGCCCCT TGCACTCATA TCTCAAACGG CTATTTCCTC CATAATCTGT 
TTAACTGTTA T 

(2) INFORMATION FOR SEQ ID MO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 3786 base £>airs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ES S : double 
fD) TOPOLOGY: linear 



8640 
8651 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
AATCTCCAAT CAGTGCCACT TCAGCTACAA AGAAGAGGAG GATAATAACT 
GGACAGACAA GAATAATTGA TAGAAGGAGT CGGTTTCACT TGCTTGACTT 
TGATwTGGAG ACTGGCAAGC AGAATGATTC CAATGCTAAT CACACACAAG 
ATCGTAGGCT ATCAAAGAAA GCAAAGAAAC TAGCAATAGC AGTGAGGAinG 
CCAAGAGTTG ACTATATTGT TGGAGAACCT TGTCTAGCGT CCAGTCCTTT 
TAAATCGTCT CACAACGAAA CTACCCAAGA GGAATGAAAA GAAGAAGAGT 
CTAGGATAGA GATGATAGAA AAAAGAGTTA AAGGAGCTAG CTCCTCAGGG 
TAATGCTTGC TATATGTCCA TAGTAAGCAT GTTTGATGTG ATAGATACTA 
AAGATGCAGA AAACAGAATG AGCAAGAGAA AGGCTGTGTA ACTGTGTGTG 
CCAACTTACT TGTAGGAGAT TTGATCGCTT CCACTAGCCA AGACCAAAAA 
GCTCTTTCCA TTTATCCCTA GATTTTGGAG CTTGGTCCGG GATATAAGGA 
ATTTACTGAT AAGAAGTGGC TCTTTCGTGG TTGCTTTTTG CTGAGGAAGA 
TCTCTTCAGC TATAGTGACT TTTTCTGTTT CTTTAGAAAG GTCTGGCTCT 
AATTAGATGC CTTCTTTTCT TCTATTTCTG TTCTCGCTTC ACTGTCTTCA 
TTTTCTCTTC TTGCTGGCTT TCCAATTCGA CTTCAGCTTG AGGGACTTCC 
GAGTATTTTT TTCAATTGGT GTATCGAGAT CGGCTATCGT TTCTTCAGCC 
CCTCTTGAGC TTGCTCTTCA GGCTTGTTCT TGCTTGTTGT TTTTACAAAA 
CAAACCATTC TTGTTTCATG GTAGAACCTC CTTTTTAGTT AGATAAATAT 
TAGCAAATGT AAGCGTTTTT GTCAACGTCT GCTTGGTGTG GATATTAGAT 
ATCAGATCTC GCAATGAGTT GATCCTTGAC ATCGGTTTTT TCAGTTTTGT 
TAATTCCGTA CCTCTTGATT CAGGCTTTTC TCTTGTGAAT TGGAAGATAG 



CCGTTCACAA 60 

GGTCTTGTAA 120 

AGGCCTGTAA 180 

ATTGGAATTG 240 

TCCTGGTGGA 300 

GTTGTCGCTA 360 

AAGCGACTGT 420 

AAGAAAAAGG 480 

ATACTTGTTT 540 

TCAAGCACTT 600 

CTTTCTAAAG 660 

GCTTCTTGGC 720 

TCTTCAGTAG 780 

GCAGCTTCAA 840 

TCCTCTAACT 900 

TTGTCTGCAA 960 

TCATTACTTT 1020 

GTTTCCATAG 1080 

CAAT ATT ATC 1140 

AAGGGTTGCT 1200 

AACCATAGTT 1260 
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GCTTGAGATG 


TCCCAGTTAA 


TTCGTTGGCT 


TTCTTTCTGG 


TCTAGGATGA 


TTCTGAGATA 


1320 


ATCTTTGGCA 


GTCAGTTCAA 


CCTTGCCATG 


GACTTGGATA 


TTTTCAGCGT 


GGAAGTGATT 


1380 


CTCTGTTGAC 


TCTAGCTGAC 


TATCTCTAAC 


AACTGTATCA 


AAGATATTAA 


CGATATTGGG 


1440 


CGTTGTGAGT 


TTACTGTTTT 


TGATACGACT 


TCCTTCAATT 


CGGAGGATAT 


AGCTGTTTGT 


1500 


ATTGAGGGTC 


GCATTTTCAA 


GGCTAGCATT 


TATGATGGTG 


GTTTGTCCGC GATTGGCTGA 


1560 


GATGTTGATC 


CCTTTTAGAG 


TTCTCCCTTT 


TGGTAGTCGG 


AGAATAACTT 


CTTCAAAACG 


1620 


ACTAGAGTAG 


CTACTTGCGA 


TATGAAGAAT 


CCCACCAATT 


CCAGAAGAGA GAAACGGAGT 


1680 


TTCAGACAGT 


TTCTTATCAG 


TGAGACTCAG 


AGTTCTATCG 


TTCTGATTGG 


TGATAAGATC 


1740 


ATGGTGAGCA 


GAAAGAGATG 


GATGGTAAGA 


AATGTGGATT 


TGATCATCGA 


AAGAGTCTGT 


1800 


GATGGTGAGC 


GTGTGTTGGT 


GGAGAGTAAT 


TTCTAGGTTT 


TCGACTTCCT 


TGCCAAAGGT 


1860 


TAGCTTTTCC 


GTACGGCTAT 


CATAGACAGG 


TTCTTTGGAC 


ATGGAAAGTA 


GGCTCTTAAC 


1920 


CCCGTCAGAT 


TGGATACCTA 


CAAAAAGCAG 


GATAAAGCCG 


ATAACGGTAG 


TCACCACACC 


1980 


AAAGATGAGA 


AATCCTTTTG 


TCCATTTACG 


CATGCTGATT 


ACCTCTCTTT 


CCTTTTTTAA 


2040 


GAACAAATTG 


TACCAGACGA 


ACAATGAGTA 


GACCGAAGAA 


GCGAGTTGCA 


TAGGAAATGC 


2100 


CAAGTAAAAC 


TAGCGAAGAA 


GCACCGATAG 


CCAGTAAACC 


AGAACCAAAA 


ATCAAGATAA 


2160 


AGGCTGATTT 


GGCTTGGGCG 


AGGACAGTGA 


AACTTTCAAC 


TAAAAATAGG 


AATCCGCCGA 


2220 


TGATACCCAG 


TATGGAAACT 


GCAAAGAAAG 


CCAGAATGAC 


AGTCAAAGCG 


GCTACAAGAA 


2280 


TTGCGAACAG 


GGTCACGAGG 


ATGGCGATTC 


CCAGAGGAAT GCCGATAGGT 


GCTGCAAGGA 


2340 


GGGCTAACAA 


GGCGATATGT 


AAAATTTGTC 


GGTTATTTTT 


TTGAGCGGGT 


GCTTCATTGA 


2400 


rmrn'ATC 


GAGAAGATTG 


GATAGAACTT 


CGTGGGCCGC 


TTCTTTGGGA 


GTTCCCAAAC 


2460 


TAGCGATGAG 


TTCTTCTTCT 


CCTTCGACTC 


CAGCATCGTC 


AAAGAGCTCT 


CTGAAATAGT 


2520 


CCATGGCTTC 


GATACGGTCA 


GCTTCAGGTA 


GTTTCTTGAG 


ATAGAGTTCT 


AGCTGAGTCA 


2580 


GGTATTCAGT 






CCTTCTATGA 


TGCCATTGAT 


GGTGTCTGTA 


2640 


TAGAGTGCCC 


ATTCATCTTT 


TAGGGTCAAG 


AGCTGCTCTA 


TACCACCGTT 


TGTCAAGGAG 


2700 


TAGTATTTGC 


GCATGCGACC 


TTGGAACTCT 


CTAGAATAGG 


TTGTCAGAAA 


GCTATTGCCT 


2760 


TCCAATTTTT 


TGAGAATGGG 


ATAGAGTGTG 


GATTCTTTGA 


TATTAGCGAT 


CAGCTTAATG 


2820 


GTTTGGCTAA 


TCTCATAACC 


ATAAGAATCA 


CCCTGCTCCA 


GTACAGCCAA 


GATGAGAAAT 


2880 


TCAATCAAGG 


CAGAGGATGT 


TGGAAAGTAC 


ATGGGAAACC 


TCCTTTTCTA 


ATGTGTAAGA 


2940 


TTTTTATATA 


TAATTTTTCT 


ACACATACAT 


TGTACATCTA 


AAAGAAAGCC 


CTGTCAAGAG 


3000 


AAATGTGTAA 


AATTTTTATA 


TATAAAAAAC 


TTCTAGCTAA 


AACTAGAAGT 


TTAAAGGATC 


3060 
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TTATCCGCTC TCTCCACTGT AAAGAGGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3180 

CCAAAGGCTG CTAGGATATA GCCTCCTCTT TGAGATAGGC CGGACAATTG GGCTGTCTTT 3240 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGAGAT AAGGGAAGAG GGCACTGGTT 3300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 3360 

AGCATGGAAA TGCCGACCAC ACCAGCTAGT GAAACCAGAG TGAGCATGAG CTGACGGTTG 3420 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGGAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGGTCATGAC GGTGTAAAAG ATCAAGGATT GAAAACCTGA AAAGATAATA 3600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TT TVr T GT rT GGTTTGTGGA 3660 

GCTAGTCTAT GATTATAGCG GTGATTTGGG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 3720 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTGTAATGGG CACAGCTAGA 3780 

TAGGAA 37 B6 
(2) INFORMATION FOR SEQ ID NO: 183: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TCAGCTAAAA AACATTGCTA AATTGATTGA AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTT C CTTCTT GATACAAAAG GACCTGAAAT CCGTACAGAA 180 

TTGTTCGAAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGCA 240 

ACTAAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATC ATGATGTTGA AGTTGGTCGT CAAGTTTTGG TTGACGATGG TAAACTTGGT 360 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATOG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 480 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AACAAGGTAT CAACTTCATC 540 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG 


GACATGTTCA 


ATTGTTCGCT 


AAAATCGAAA 


ACCAACAAGG 


TATCGATAAC 


660 


TTAGATGAAA 


TCATCGAAGC 


AGCTGATGGT 


ATTATGATTG 


CTCGTGGTGA 


TATGGGTATC 


720 


GAAGTACCGT 


TCGAAATGGT 


TCCAGTTTAT 


CAAAAAATGA 


TTATCAAGAA 


AGTCAATGCT 


780 


GCAGGTAAAG 


TTGTTATCAC 


TGCAACAAAC 


ATGCTTGAAA 


CAATGACTGA 


AAAACCACGT 


840 


GCAACTCCTT 


CAGAAGTATC 


AGATGTATTC 


AACGCTGTTA 


TCGACGGAAC 


TGACGCTACA 


900 


ATGTTGTCAG 


GCGAGTCTGC 


AAACGGTAAA 


TACCCACTCG 


AGTCAGTAAC 


TACAATGGCT 


960 


ACAATCGACA 


AGAACGCTCA 


AGCTCTTCTT 


AATGAATACG 


GACCTCTTGA 


TTCAGATTCA 


1020 


TTTGAGCGTA 


ACTCTAAGAC 


AGAAGTAATG 


GCTTCTGCTG 


TTAAAGATGC 


TACTAGCTCA 


1080 


ATGGATATCA 


AATTGGTTGT 


AACTCTTACT 


AAGACAGGTC 


ATACTGCACG 


TTTGATTTCT 


1140 


AAATACCGTC 


CAAATGCTGA 


CATCTTAGCA 


TTGACATTTG 


ACGAATTGAC 


AGAACCTGGC 


1200 


TTGATGTTGA 


ACTGGGGTGT 


TATCCCAATG 


TTGACAGATG 


CTCCATCTTC 


AACTGACGAT 


1260 


ATGTTCGAAA 


TCGCTGAACG 


TAAAGCGGTA 


GAAGCAGGTC 


TCGTTGAGTC 


AGGCGATGAT 


1320 


ATCGTTATCG 


TTGCTGGTGT 


GCCAGTAGGA 


GAAGCTGTTC 


GCACAAACAC 


AATCCGTATC 


1380 


CGCACAGTAC 


GTTAAGAAAA 


ATATAAAAAC 


CTATCATATC 


CAGCTTTAGA 


GCTTGTGTGA 


1440 


TAGGCTTTTT 


GTATAGAGGG 


TAAGAAATAG 


GCAAAACTTT 


CATAATGGAT 


T GAT ACTCTT 


1500 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCGTCG 


CCTTACCGTA 


TATATGTTAC 


TgACTTCGTC 


1560 


AGTTCTATCT 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCAACtG 


CGGCTAGCTP 


CCTAGTTTGC 


1620 


TCTTTGATTT 


TCATTGAGTA 


TGAAATAAGA 


TATGCACAAA 


TTGATTAGAA 


AGTCAAATGA 


1680 


ATTTCTACAA 


ATGTTTTAGC 


AATCGTAATG 


TACTTGTCTA 


GATTCGATCT 


GATATATTTT 


1740 


CGATTTAATG 


ATATGGTATT 


TAAAACCTCC 


AAAGTAGCTT 


ACTCCATTCT 


TTTACTTACG 


• 1800 


TGAGTGTAGA 


TGTTATTTAC 


TGTTTTAGCG 


TTTTTGTGTT 


CCACTCTAAC 


CATTATAGCA 


1860 


TTCTTCTCAG 


CTAGTGTACT 


AAGGAGTGTG 


TGCCTGAAAA 


TATGGGAACT 


AAGGGGCTGG 


1920 


TTTATCGGTT 


TCTCTAGTTT 


AGTA J r'i J l\iCC 


TTTTGCAAAG 


TGATCTTAAA 


TGCCTTTCTC 


1980 


TAAATTTACA 


TATCACTATT 


GTTTAACAAA 


ATCTAATCTA 


TTTTAGGTCA 


CTTATTCTTT 


2040 


TTTTGAAATG 


TAGAATGAAC 


TTTTTCAAAG 


TTTTTCGAAT 


CTTTTAAAAT 


CTGTTTGCTT 


2100 


TATATCGCCA 


TTCTCCCCCC 


TTTTTTAATT 


CTCCCTATAT 


AGCCTGACAG 


CTTTCCCGAT 


2160 


GGTACGAATA 


TGGTTGCTTT 


CGTCTAGGTG 


GATGTCGGGG 


TATTCGGGAT 


TGAGTTrHT 


2220 


TGAGGCAGCC 


TTGGCGGAGT 


TTCTTGACAT 


AGTTAGTGCC 


GTCTACTTGG 


AAGATGCCGA 


2280 


TGGTATTATA 


GTCAATCTGT 


GGGGTATTCT 


TGATAAATAG 


GTAGTCGCTG 


TTTCTTATCT 


2340 


TTGGCTCCAT 


GGACTTGCTG 


ACGACATAAG 


CGATTGGGTC 


GTAGTCGTCT 


GGGATAATGG 


2400 
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AAACTCCATA TCTAAATCGT TGTCCTGCAT CGAGCGGCTA CCTGCAGAGA TAAACTACCT 2460 

AACACGAGAG TAAGTAGTCT GTCTGTAGTC GTCCAGTCTG ATGATTTTTA CGATACTTCG 2520 

TTTTTCTGAT CATACAGTTG CCTCTCGGCA TAGGTCAGAA CTTTACCTTG TCTGGGTGGT 2S80 

TCCCGTTGGT CGTAGATAGA TTGGATATCG CTAGGAGAAT CCTTTTGAAC TGGAGGAAAG 2640 

AGGGCATCGA TCAAGCTACT GAATACTTTA ACTAAGTCAA ATATAGTATT TTTCTTAGTA 2700 

GACCTAACCC TTTTTTCATA ATTTCTAATG GTGTTTTTAC TTATACCTAT CTTAGTACCC 2760 

AATTCTTATT GAGTCCAACC ATTACTAGTC TATATTGTTT TATAGTTGAT TGAGTTTGGA 2820 

ATAGTACGCT GTAGCTGCTA AAACATTTCT AGAAATTAAT TTGACTTTCC TAATAGAGTT 2880 

GTTCATATCT TATTTCAATC TATTATGTTT TTCACCTCTA ACAATCGCAA TCTCTTCTTT 2940 

ATCCATGAAT GAAATCGCTT TCTATTTTTG TAAGTAAAGC ATAACACGAA ATCCACGAAA 3000 

ATGAAAACCT TTGTTGTGTT TTCGTAAAAA ATTTGTTGAC AGAGCACGAA ACGC 3054 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TGTGATTTTC yCAAAATTTG GTAAAATATA TCTTAATCAT TTTCAGGAGG ACAAAAATTT 60 

GACAAGATAT CAGAATTTAG TAAATGGAAA ATGGAAATCA TCTGAACAAG AAATTACGAT 120 

TTATTCACCA ATCAATCAAG AAGAATTGGG TACAGTTCCA GCCATGACTC AGACTGAAGC 180 

TGATGAGGCT ATGCAAGCTG CGCGTGCAGC CCTCCCAGCA TGGCGAGCTT TATCAGCAGT 240 

TGAACGTGCG GCTTATTTGC ATAAAACAGC AGCTATTTTA GAACGCGATA AGGAAGAAAT .300 

TCGTACTATC CTTGCCAAAG AAGTAGCAAA AGGGATTAAA GCAGCAATTG GAGAAGTAGT 360 

GCGTACAGCA GACTTGATTC GTTATGCTGC TGAGGAAGGT CTCCCTATCA CTGGACAAGC 420 

AATGGAAGGT GGTGGTTTTG AGGCAACAAG TAAAAACAAA CTGGCTGTTG TCCGTCGTGA 480 

ACCAGTTGGT ATCGTGCTAG CGATTGCTCC CTTTAATTAT CCAGTTAATT TATCTGCTTC 540 

TAAAATTGCA CCTGCCTTGA TTCCAGGGAA TGTGGTCATG TTTAACCCAC CAACACAAGG 600 

TTCCATTTCT GGACTCTTGT TGGCTAAAGC ATTTGAAGAA GCAGGGATTC CGGCAGGTGT 660 

TTTCAACACC ATTACAGGTC GTGGTTCAGA AATTGGGGAT TATATCATTG AGCACAAAGA 720 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC TCCTATTGGA GAACGTATTG GTCGTTTAGC 780 

TGGTATGCGT CCTATCATGT TGGAACTTGG TGGGAAAGAT GCAGCTCTTG TACTAGAAGA 840 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG GCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC AGTTGGTGAT CCATTTGACA ATCCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 1080 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GGCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGCCA ACGAATCTGA 1260 

ATTCGGCCTT CAATCATCAG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAG TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATACCATTGA 1440 

AGCGATGACA AATGTCAAAT CCATT GT T T T TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GGTTTTATTT TTTTGCTATA AAATAATAAT AATTATAGAA AAAATACGAA 1560 

CTTTTTGGTA TTATAATAGA TTGAAACCGG 1590 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4848 base pairs 
(B> TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CAGCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACTAGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACCAG ACCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGAGCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA GTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGG TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATAAAAA GGTGAACCAC 600 

CC AT ACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 

TTCTATAAGA AAATGGGATT TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 720 

CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 780 

CTGCTAAAAG TGTACTTGGT AGTAACCAAG GAAGTAGAGC ACCATATTCA AATAAGAAAT 840 

CAAAACGAGA TTTATGTTTT CTGACAACAC GAGCAAATAC AACTGCGAGA ATTGTTGCTG 900 

TTGTCCCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 1020 

GAATTGCAAC TGGATCTGTA AATGAGTATA ATACTATAAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 1140 

TTTGTTTTTT AAGAGGCGCT TTAGTCTTAG AGATAGAAAT ATAATTTCCA CCTTTTTCTA 1200 

TCTTATTCAT GATAGTAAGC AAAATTGTAG TTGCAATACC TAAAATAATT GCAAGTAGGG 1260 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTGCTACTGC AGATAAACCA CTAAGAAAAA 1380 

CCATAATAGT AAGTGCAAAT AGAGTTGGAA TTAAGGTTGG TAACACTACT TTTCGGAAAA 1440 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAACCACTC 1680 

CTCCATAAAT TAAAGAGGTC ATATAACCTA ATTTTAAAAT TTTAGCACCT TTAATATCAA 1740 

AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1800 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACGAT 1860 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TTACAAATAC ATTCACTACT AGATCAAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 1980 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTGT TCAGTTTCAC GATTGAATCG 2220 

AACTTTCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 2280 
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TAATACTTCG 


TGGACTGTTT 


CATCGGTCAA 


, AACATTAATA TCTCCAATAA 


AATCACATAC 


2340 


AAATTCAGTT 


TGAGAATTAT 


GATAAATCTC 


TACTGGTGTA CCCACCTGTT 


CGATGTATCC 


2400 


ATTGTTAAAG 


ACTGCAATTC 


TATCAGATAA 


AGTCAAGGCT TCCTCTTGAT 


CATGAGTAAC 


2460 


ATATAAAGTA 


GTAATACCTA 


ACTCTTTTTG 


AAGTCTTTTC AACTCTTTTC 


TCAAATCTAC 


2520 


ACGTAATTTT 


GCGTCAAGGT 


TTGACAATGG 


TTCATCTAGA CAftftGAATTT 


TAGGTTCAAC 


2580 


AACCAGAGCA 


CGAGCCAATG 


CTACCCTTTG 


TTGTTGACCC CCAGATAATT 


CTGATACATT 


2640 


ACGCTGTAAC 


TGTTGATCAG 


AGATCTTAAT 


TTTTGCTGCC ACTGCTGATA 


CTTTAGCTTT 


2700 


AATAACATCT 


GGAGCTACCT 


TCTTAACTTT 


TAAACCAAAT GCAATATTAT 


CAAAAACAGT 


2760 


CATAGTTGGA 


AATAGCGCAT 


AAGATTGAAA 


TACAATACCA ATTCCACGCT 


TTTCAGGTTC 


2820 


CAAATGACTG 


ACATCTGTTC 


CATTAACTTC 


AATACTTCCT GATGATGGAT 


CTAGAAAACC 


2880 


TACCAATGCT 


CTCAAAGTAG 


TTGATTTACC 


ACATCCTGAA GGCCCAAGAA 


ATGTAAAAAA 


2940 


TTCCCCTTCA 


TGTATATCTA 


AATTCAGATT 


ATCAATTGCA ACAAAATCAC 


CATATTTAAT 


3000 


TTGAATATTA 


TCAAATTTAA 


TCATCTCACT 


AACTCCCTCT ATTACTAAAC 


CAAAAGCCTC 


3060 


TCTTTATTTC 


TTCCATAAAT 


TTAGAAATAA 


TAGAGAGACT TGGACATAAA 


AATTAACTCT 

w mm mm ^ m mm 1 » ^m* • m 


3120 

mf -m> mm V 


TATTTCTTAT 


TGTACGTATT 


CTAATTCAGC 


TTTTTCTACC CATTCATCCA 


AATGCTTTCC 


3180 


AACAGCTTCC 


CAGTCAATAT 


TTTGTGGTTT 


CACTTGATCA ACAAATTTCT 


TCGTATCTTC 


3240 


aggtagAtct 


TTGAGGGCAT 


CTTTATTTGC 


AGGAATAGAT CCAAAGTTCT 


TACTATATTC 


3300 

mr U 


TACTTGAATT 


TCTGATTGAC 


CAAACCAATC 


AATAAATTCT TTAGCTAACG 


CTTGTTTTTT 


3360 


ACTAGTGCTT 


AAAACCATAG 


TTTGTTCAGT 


TACAAATGGT ACACCAATCT 


CAGGAGTCAT 


3420 


AACTTTGAAA 


ACAACATTTT 


GTTCTTTTTG 


TCCAACTAAT GCACCAGAAC 


CCCACATCAT 


3480 


TCCATATTGT 


ATTGGATCTT 


CTTTGTCTAA 


CATCTTAACA ATTGAACTTT 


CTCCCTTTTG 


3540 


AAGAGTGTAT 


GCATTTTTCA 


AATATTCTTT 


TGCTACTTCC CAACCTITTT 


CGGAAACACC 


3600 


TAATTCACCT TTATCATCAA GGTATCGAAC TAAGATACTT GCTAGAATTG 


CCCGTCCTGT 


3660 


ACCTCCTTGA 


AGACCAGAAA 


TTGAATATTT 


ACCTTTATAC TTACTACCTA 


ATTCAGTCCA 


3720 


ATCTTTAGGC 


ATTTCTTTTA 


CATCAGGCGC 

■v 


CCCAATTAAA ACTAATGGTT 


GAACAATCAC 


3780 


AGGATTATAA 


TAATTATCTT 


TATCTGATAA 


AGATTGATCA ATTTTATCTA 


ACCATTTAGG 


3840 


CTTGTACTGT 


ACTAGTAATT 


TTTGATCTCT 


AATTTTATTT GAATCAACAG 


CACCAATTCC 


3900 


AAATACCATA 


TCTGCAACTG 


CATTATTCTT 


CTCAGCAATA ACACGGTCTG 


CTAATTGAGC 


3960 


GCCAGCGATA 


TCAACCATTT 


TTATATTAAA 


ACCAGCTTCT TTTGCTTTAG 


CAGTTAACCA 


4020 


ATCACCACGA 


CCATTTGAGA 


CTGAGTTCGA 


ATAGATAACT AATTCTTGAC 


TTTTATCAGC 


4080 
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TTTTTCTTCA 


GATGAAGAAG 


CAGTCGTAGA 


ATTTGAACCT 


CCAGAGCAAG 


CAGCAAGTGT 


4140 


AGTAAgAGCA ACTCCCGTTG 


CAAGTACAGT 


AGACCAAACT 


TTCATTTTTT 


TCATGATAAG 


4200 


4 A^&w^l X & A 


1 1A1 1 S\ i 111 


ATTTAAATTT 


TTCGTGATAT 


GGAACAAATT 


GTCTCATATC 


4260 


TTCAAATACA 


GTATAGTCAA 


TACGGTTTAC 


ACTAATAGTT 


GGAATCTTCT 


CTAATAAAAT 


4320 


TTCAGTTAAT 


TCTGCTCTGA 


CTTTAGTAAA 


CTCTTCTTCC 


TCCTCTTCGG 


TTAGAGGAAT 


4380 


CCGAAGATAC 


CCAATTGAAA 


TATGGAATTG 


ATATCTATCA 


TGATTAGGGA 


AACAAACACC 


4440 


TGCTTTTTCT 


GAGACATAAG 


TACGAATTTC 


TTCTAATCTC 


TTTGCAGAAG 


CTTCATCTGC 


4500 


AGGTTCAACT 


AGTATGTTTT 


GTTTTCCCAT 


TTCAGTTATA 


CGCATATGAA 


TTTCTTCATC 


4S60 


CAACAATGGA 


AAAATTTCAA 


GTTGTTTAGC 


AAAGTAATCA 


TGTATTTCCT 


GTAAAGGTGT 


4620 


ATCTAGAGGA 


AGATTACTGC 


TCCAAAACTC gtTCACGATT 


TTCATGGCAC 


AACAATTCAA 


4680 


TTACAGTCAT 


GTGAATAGAA 


TTCCTTGGAG 


TTAAAGTAAA 


CTTATCGATA 


AATGGTAATT 


4740 


CTCTATAACG 


TGATTGAATA 


ATATCAACAA 


CTTCCATCAA 


ATCTTGTTTA 


GTATAAAGAT 


4800 


TTCCTACAAC 


TGTATTCCCA 


GGGAAATGAT 


TAAATTCCCC 


ATTCTCGG 




4848 



(2) INFORMATION FOR SEQ ID NO: 186: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3763 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA 1*20 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 180 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 240 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 360 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 

TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TCT G TTTTTT TCTACACTTA TACCATAAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA AGGCAAGGAT TCTGGATGGT TTTTAGATTT GGGGTGAATA ATTGGGGATT 
TAGGAGAAAT GATGGTATCT TCCAAATCAA AATCAACTTC ACTCCATAGT CTCAACTGAT 
TGATTTTCCC ATCTTGATAG GTCACATCCT TGTCAAGGAT AAACTGAGTC AACACCTCAT 
GTTGACCTTG ACACCTGATG TCATCTACCA AGAGCCAGAC ATCCTCTACC AACATGAGGA 
TTTTTCTCCT GTGAAGATAA GGCAAATCAG GTTCTGCTGA CCAATAAGCC CCCTCAATAT 
AATGCACTCC CTCCCTTTCT TTATGGTGAC AAAACAGGGA GTGAGGATAG TATTCATATT 
CCCAGGATCC CGTGATTCTT TCCGGAGCTT TCCCATCTAC AATGCAGGTC GAATGACTCC 
AAGCACTCTT TAAGAGATAA CGTTCATATA TCTCCCGATA AGAATAACGC CCAGCATCTA 
TGAAAATAGG TTGGCCTTGA TACTGTAAGC AAAAACTATT CTCGTCACTA TGACTATGGG 



CACTTCCTAG CGGACCATTT 
GTCCAGAGTC TTCAAAGATC 



TTGAAAAATA GATAACGATG TTCATCCTTA ATGCAGACAT 
ATGGACTTAG GCTGCCAAGC TCTCTTTTCA AATTCCTGCA 



GTCGCTTGAC CTTTTCTCGC CCCAGGAACA AGAGGCTAAG CAAATCAACT TTAACATCCA 
GACCGTTAAG AAGGTCTTCC TGGTTCAAAA CCACAGCAGA CAGGCTCAAA ATTTCTGTCG 
TTTCTGTAGA ATCGCTATCA CCAAAAGCCA AAGTCCGTCC ATCTAAGCCT GTCATCATTT 
GAATATAGGT CGCCATCTTT TCCAGCAACT CTTGGTAACT ATCTTGCAAG TCTGGAAGCA 
AGAGACACAA ATCCAGCAAG GCTTTATAAA CCTCTACATG ATAGAGAATC GACTGTTCAA 
ACTGGCTTCC ATCTCCTAAA ATCTGTGTCT CAATTTGCTG TTTCAACTCC TCTGAAGCAA 
AATGGTAAGC TTCTTCTAGA TCCATCTTAT CTGAAAAGAA ATGATAGATA GCAAGCATCG 
GAATTGTTTG TAAAATCCCC CAGTTACTAA GGGTGTACTT GGCGCGATAG TAGCTTTTCA 
TAAAGTCAAT CTGCTTTTCT AGACTGACCA AAATTTTCTC TAGTTCTTTC TCCTCTAGCA 
AGTCAAATTT CAAGAGGAGC AAGACTAGTT TCAACCAAGT AAAGGAACGA ATACCCGTAT 
CCAACGTTCT AGTCATCAAG GATTGAGGAG AAAATTCTCT CACCTGCTCA ATCCAATCAA 
ATAGAAAGAA CTTGCACTTT TGAATATAGT CCTTATCTCC TTCTACCAGA TACCCTATCA 
TAAACTGCAA GAGATATTCT TGTCGATTGA GCATATAAGA CCATTCTGGA TCATCTTCAA 
ATACTTGATC CCATACCATC GGCTGGATTT GATGGATTTT TGAACAAGGC TCCATATCCC 
AAGGACTATC AAACATAAAA CGATTGTCCA TCAAGCGTTC AAGGGAACTC TTGACTTTCT 
CATAGTCTTT TGAACAGTGC GACAAGATAT AATCACGACA TTGATTTCCA TCGACTCTTT 
CAAAAAATTG TCTTCTTTCT TCTTTCATTA TCTATTACCA GAAAAAGAAC TACTTAAAAA 
GCAGTTCTTT TGTCTTTCCC ATTACACTTT CCTTTTCTAC ATGGATGACC ACACCTTTTG 
CAATCTGCAA GGAGACCAAG TCATCTTGGA TAGAAATGAT TTTTCCATGA ATTCCAGACA 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
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ATAACAAPAC 


1 1 \-f\ I l^Al_ Un 


AATGTTAAAG 


AAGCTAAATA 


CTCTTCTCGT 


TGCTCCATCT 


2460 




uytu ill i 


I\»AL bAA 1 AO 


AATGAAAGCT 


TGACAGTAAA 


k. ^^^^^^^^ * ^^^^^^m. a. 

AGGGGACTCA 


2520 


CTGPCAAfiAP 


nnl\.ALlnl 1 


ft. ft A. A. A XpR 

A I AAAAUi 


ATGTTGTATC 


CATTAAGCTA 


TAATCTTAAG 


2580 


PPAfifTTPfV: 


ntAAl ILlbn 


rviATAACTGT 


TAAAATAACG 


AGTTTATATC 


TTGTCCATTT 


2640 


P^P'P^^'P^*!* 
Vi t 1 1L1 1 1 V* 


A I U AAliTAGT 


AaACTaAAAG 


TGTAAATAGG 


GCTGGTAGAA 


GAGCTGGAGC 


2700 


AACCTTATCA 


AGCATTCCCT 


GAATACTTAC 


GATACTTTGT 


TTAGCGTCTG 


CTTTAACTTC 


2760 


CCCTGCAGCA 


AAGGTAATCG 


GCACCATAAT 


CTTAACAGAT 


GTCGCTGCCA 


AACCAGCAAT 


2820 


TACCtTACAC 


CGATAATATT 


GGCAATACGA 


GAAATCGTTG 


CCATCTGTTC 


GCTTAGTTTA 


2880 


TCAATCACAG 


TTGTTCCTAG 


TTTGTATCCA 


TACAGACCAG 


TTGACAATTT 


AATCGCTGTT 


2940 


AAAATCGTAT 


TCATCGCAAG 


GAAGAACAAG 


ATTGGACCGA 


CAACCAAGCC 


TTCTTGAGCA 


3000 


AACGAAGCTG 


CGATGGTTGA 


GAACAATGGA 


GCTAAACAGA 


ATTGAGAAAG 


AGAATCCCCA 


3060 


ATACCTGCCA 


ATGGTCCCAT 


CAAGGCCATC 


TTGATGCTAC 


GTGTTTCTTT 


TGCCGGACGG 


3120 


CCATTTTCCA ACATTACAAG 


ATGCAAGCTG 


GTAATAAAAG 


GCAGGAAGTG 


TGGGTTGGTA 


3180 


TTATAGAATT 


CACAGTTTTC 


TTCCAAGGCT 


TGGTAGAAAC 


CTTCCTGATC 


CTCTCCATAG 


3240 


TGTTTTTTCA 


AAGCAGGATA 


CATCACATTG 


GCATATCCCA 


ACCCTTGATA 


GTTACTATAG 


3300 


TTAAATCCAT 


TTTGACAAAA 


GAATGCCCGC 


AAAGACGTTT 


TAAGATAATC 


ACGTTTTGTT 


3360 


AATTTGTTAG 


ATCCAGTCAT 


CGTGTGCTTC 


CTCCTCTACC 


ACATGATCCG 


CTGTTTTTGG 


3420 


CTTGTTATAA 


AATTCAATCA 


AAGCAAAGAT 


AGTACCTACA 


ATTGCAATAC 


CAATTGTTGC 


3480 


GATGTTTAGA 


TAAGCTGCAC 


AAACATATCC 


CAACAAGACA 


AAGGGAATCA 


ACTCTTTCTT 


3540 


AGCCATCACT 


GACAAGATCA 


TCGCAAAACC 


GATAGCTGGG 


AGCATTTTAC 


CAGCAACTGT 


3600 


CAAACCTGTA AGTAATACCG 


GTGGAATGTA 


GTCTACGAGT 


TTCAACAAGG 


TATCCATTGA 


3660 


AAGGGCACCA 


AGCAACCCAA 


GGTAAATCCA 


ATAAAGGCAA 


ACAACCAAAT 


TGTTGCATTT 


3720 


AGAGTGAACT 


TAAATTTCTT 


CAAATTATGG 


TTTTTCAAGT 


GCT 




3763 


(2) INFORMATION FOR SEQ ID NO: 187: 









U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
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CAATCTCTGA 


, GTATGTGCGG 


TCAATACTAw 


CAAAGGGAAT 


yCCTGACGTC AAGTAATGTT 


60 


CAATTGGmCT 


ATAGGTAATG 


GCAACCACTC 


CATCAACTTT 


ATTATGACGC 


AACATCTCCA 


120 


GATAGTCTTG 


CTCTCTATTT 


GTACCATTGA 


TAGAACATAA 


GAGTAATTTG 


TTATTTCTCT 


180 


TATAGACTTC 


ATTTTCCACA 


TGCATAGCAA 


ATTCTGAAAA 


GAAGGGATGC 


CAGATACTTG 


240 


GTACAATGAT 


TGCAATCGTT 


TCTGTTCGAT 


TTTTTTTCAT TCCTCTAGCG 


TAGTAATCTG 


300 


GAATGTAATT 


CAAAGTTTTA 


ATCGCTTGTT 


ccactttttt 


CAAAGTTACT 


TCTTTAATGC 


360 


CTTTTTCTTT 


ATTAATTACA 


CGTGAAACAG 


TTCCAACACT 


AACTCCTGCT 


TCTAAAGCAA 


420 


CATCTTTCAT 


GGTAATTGAT 


TTTCTTTGTT 


CTACCATATT 


ATCACCTCCT 


TTCAATATAT 


480 


AGTATCATGC 


AAATGCTTTT 


TAAGCAACTA 


TTTCTCAATC 


ATTTTTGGCC 


AGATCATTTA 


540 


TCCCATCATG 


AATAAAATCA 


CTCCAATTAG 


CTTTTGAAAA 


TACTTCAATT 


TTCATGTGTA 


600 


AACATCTACA 


TAAAACAGGA AAAGCCTTGG 


TTTCATGGCT 


TTTTTCGTAT 


CTTCTATAAA 


660 


AAAAGCAAGA 


GTTTTAGATG 


GCTATAAATC 


TAGATGTACA 


TTTTGCTTAA 


ATGATTGAAG 


720 


GTCTTTTCTT 


AACAAAAACA 


CCCCCAAAAT 


TAGACTTTTT 


CTGTCTAACT 


TTTGAGGTAC 


7B0 


AGTTCAAACG 


CGAAATAGCG 


TTrrTTTGTT 


ATTTTTGGTT 


ACTCATCTAA 


TCGAATAAAC 


840 


ATCATGGCAT 


TTAACAAGTA TATGAGTGAG 


ACCGTGTTTA 


TATTATTTGA 


ATAGATGAGT 


900 


CTCTTATTTT 


CAATAGGAGG 


AATAATAAAA 


TTAGAAATAA 


TGATATCATA 


AGGTGAATCT 


960 


TCTAAAGATT 


CCTTTGATAA 


TTCTAATTCA 


GTCCAAACTT 


CCAGTTCAAA 


ATTATTGCTA 


1020 


CAATAATAAG 


AAAGTGTCTC 


TGCAACGAAT 


TTTGCATGAT 


ACTGATCAAA 


ATTACTCATA 


1080 


ACTAAAACCT 


TTAGTTTAGG 


CTGATTTTGT 


AGCAAATTAA 


TCACCAAATG 


TTTGGTATGA 


1140 


GTGATGAAGG 


TATAAGATAG 


ATGATTTACC 


ATCATTGAAC 


TAGAACAAAC 


CTCAAGAGTC 


1200 


TCTAAATAGT 


GAGAAAGCTC 


TTrrrrrATA 


TCTGAAACAA 


ATTTTGGAAA 


AATATTTTGA 


1260 


AAGTTCCTGA 


TTGTATTCCC 


TTTTTGATCA 


AATAAAATAA 


ACTCAGTAAA 


CAACTCTTGA 


1320 


CGATACAGAT 


GTGCGGTATT 


ATGCAGATGC 


CAAATCAGAT 


TATCCTTATT 


CTCCATTTCn 


1380 


ATCTGATACT 


TGACTGAAAT 


CTGATCAATA 


AAATCACTCA 


ATAGATGGTA 


AGATTTTTCA 


1440 


ACATAACTAT 


ccttttttac 


GCATTTCATA 


AAGAGACTTT 


CATCTATGAA 


AAACATTTTT 


1500 


TGAAAGTAAG 


ACACAAATAA 


TTGGCAAACA 


ACTTCTTCAT 


CTAAAGAGAT 


ATTGTATTCT 


1560 


GATTCAAAAC 


TCTGAGCAAC 


ACCTTCTATT 


CCTTCTGCCT 


GCATTAAAAA 


ATCCAAACTT 


1620 


TGGTCGTTAA 


AAGAATCTTT 


ATCTACTTCC 


ATAAAATGAC 


CAAACTTTAT 


TCTATATAGG 


1680 


TTCGTAACTA GGAGCAACTT TAGCATTCTA TGCGTTGACA AATTCATTGG AAAGCTTGTT 


1740 


TCCTTATAAA 


CCAATTCTAA 


CAATTGAGAT 


AGTGGCTCTG 


ATGAAAAATT 


TTCAAATGGC 


1800 
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WAX lv,t /UiUA 


AATAATATTT 


TTCTGAAAAA 


TATTGTGCAA 


AAAAGTAACG 


AATGTCTCTC 


I860 




TGATTTGAAC 


ft MAAAMA ft ft 

AGGGGTCAGA 


CTAACTTCAA 


ATTGAAATTG 


CCTTTTAATC 


1920 




TTTGGCTAAT 


AATACGATAG 


AGOGAAGATG 


AACTGATATA 


AAATTCTTTA 


1980 


CAAATACTCT 


CAGCTTGACA 


ACCTTCATTA 


AAGAAGATGA 


ATTCTAAAAT 


CGAAAAATGA 


2040 


GTTGAATGTT 


TAAAGAAATG 


m jVa^*^««a*ft m & 4m *m 

ATGGTAAACC 


ATTTCAATAT 


CACTATCATC 


GGTATTAATA 


2100 


ft fM/WMI% ft 

ATGCGTATAC 


CATTAGTAGA 


AGAATGAAAA 


ATCAAGTCAG 


GAAAAGCAGA 


TTTAACATGG 


2160 


% in ft ft hia h rth 

GATAGATCAT 


CTTTGACTGC 


ACGTTCTGTA 


CAATTTAATA 


ACTCTGCTAG 


TTCAGAACGA 


2220 


TGAAACCAAC 


GTTTATGTTC 


AAATAATAAT 


TCTAATAATT 


CTAATTGCCT 


ATGACTTTTT 


2280 


TTAGATAATA 


AATCTCTCAT 


GAATATCTTT 


CTCTCTTTAT 


AAATTATCGG 


ATTAAACCTC 


2340 


TTGCAATTAT 


ACCACAAAGA 


ATAGGTATAG 


CATGATATAA 


CGACTTTTCC 


TAAAATCTTT 


2400 


TATTTCGTAT 


AATAACACTA 


CGGAGACAAT 


ATATAAACAA 


TTTTCTTATT 


TTACCGTCTA 


2460 


TTGAGGGCGT 


GAATACAGAA 


TCAAATTCAA 


GTCTAAAGAT 


TATATTTTTA 


ATTTTAAAAA 


2520 


TTATATAATA 


GCAACAATTA 


AAGAATTTGA 


TTTTTTAAAA 


TTATATAATA 


ATAACAATCG 


2580 


AAATAATTGA 


CTTTTCTATA 


TTAAAGTTAT 


ATAATAGTAA 


TAATCAAAGA 


AATTGATTTT 


2640 


TTGATATTAA 


AATAAAAAAG 


GAGGGTAGGC 


AGTGTTGTGA 


TCAATTATTG 


CTGGAGGTCT 


2700 


TATTGGTCTC 


TTGGCAGGTA AAATCACTAA 


AAAAGTAGTT 


CTATGGGAAT 


CATCGCAAAT 


2760 


GTATTCGCTG 


GTTTAGTCGG 


GGCATATGCA 


GGACAATCTC 


TTTTAGGTAG 


TTGGGGTCCA 


2820 


GCAATCGCTG 


GAATGGCTTT 


GCTCCCATCT 


ATTGTAGGTG 


CAGCGATTGT 


GATTACTGTA 


2860 


GTGTCATTCT 


TTACAGGTAG 


AAAGTAAACT 


TTTCGCCAGT 


AAAGTTAGCA 


AACTATTTTT 


2940 


AAATCAATGA 


CGGGAAAAAT 


AGTTTAAATG 


TTAAATCGAA 


AGGATTGTAT 


ATGTCAAAAC 


3000 


CAAAGAAAAT 


ATGTTTCATT 


ATTTTCTGTA 


TTTTAATCTT 


GACAATTTTC 


CTTCCTGTTT 


3060 


TGATAGATTA 


TCATCAAGTT 


AGTGATCTAG 


GTATTCATCT 


ACTTAGCTGG 


AGACAGAACT 


3120 


CCGTAGTTGA 


ATTCTATCTT 


GCTAGATATG 


TCTTTTGGGG 


GACAGTGGTT 


CTATCAACTT 


3180 


TAGTTTTATT 


ATCCATTTTA 


GTTGTGATGT 


TTTATCCTAA 


ACGTTACTTG 


GAAATCCAAC 


3240 


TTGAAACTAA 


AAACGATACA 


TTAAAATTAA 


AGAATTCGGC 


AATCGAAGGT 


TTTGTTAGAA 


3300 


GTTTGGTGAG 


TGATCATAGA 


TTGATCAAGA 


ACCCAACTGT 


TCATGTAAAT 


TTACGAAAAA 


3360 


ATAAATGTTT 


CGTTCATGTA 


GAAGGTAAAA 


TTCTTCCTTC 


AGACAACATC 


GCTGACAGAT 


3420 


GCCAAATAAT 


TCAAAATGAA 


ATAACTAATG 


GATTGAAGCA 


GTTTTTTGGT 


ATTGAGCGTC 


3480 


AAGTAAAACT 


TGAAGTTGCA 


GTAAAAAATT 


ACCAACCAAA 


ACCTCAAAAC 


AAAAAGACTG 


3540 



WO 98/18931 



PCT/US97/19588 



1140 



TTAGTCGTGT 


GAAGTAAGGA 


AGTAAAAAAT 


GGAATGGCTT 


AAACAATATC 


GATATCCAAT 


3600 


TATCGCTGGT 


CTCATAGGCG 


TATTTCTGGC 


TTGTTTGATT 


GTCTCCTTTG 


GCTTCTTCAA 


3660 


AACAATATTT 


GTATTGATTT 


TAGGAGCACT 


GGGAGTTGCA 


GCTGGATTAT 


ATATCGAAAA 


3720 


AAACTATATA 


GATAAATAAA 


AAAATAAAAA 


TTACTAATTT 


AATTAAAGGA 


GTTTCATATG 


3780 


TCAAACGAAA 


AAAACACAAA 


CACTAACGTA 


GAAAAGAAAG 


ATGCTACTGT 


TGTAGCTCAC 


3840 


GAAATCAAAG 


GGGAACTTAC 


TTACGAAGAT 


AAAGTTATCC 


AAAAAATCAT 


TGGTCTTTCA 


3900 


CTAGAAAACG 


TTTCAGGTCT 


TTTGGGAATC 


GATGGTGGTT 


TCTTCTCAAA 


TCTTAAAGAA 


3960 


AAAATCGTTA 


ACAGCGATGA 


CGTAACAAGT 


GGTGTTAACG 


TAGAAGTTGG 


TAAAACACAA 


4020 


GTTGCAGTTG 


ACTTAAACGT 


TATTGTTGAG 


TACCAAAAAA 


ATGTTCCAGC 


. TTTATATTCA 


4080 


GAAATCAGAG 


AAATCGTATC 


TTCAGAAGTT 


GCTAAAATGA 


CTGACTTGGA 


AATTGTTGAA 


4140 


ATCAACGTAA 


ACGTTGTCGA 


CATCAAAACT 


AAAGAACAGC 


ATGAAGCAGA 


CTCAGTAAGC 


4200 


CTTCAAGATC 


GCGTATCTGA 


CGTTGCTGAA 


TCAACAGGAG 


AATTCACTTC 


AGAACAATTC 


4260 


GAAAAAGCTA 


AATCTGGTCT 


TGGATCTGGT 


TTCTCAACTG 


TTCAAGAAAA 


AGTTAGCGAA 


4320 


GGTGTAGAAG 


CTGTTAAAGG 


TGCAGCAAAT 


GGTGTAGTAT 


CTCACGAAAA 


CACTCGTGTA 


4380 


AACTAAGATA 


AAATAAATAT 


AACAGGAGAA 


ATTATCATGT 


CAGTAGAAGA 


AAAATTAAAT 


4440 


CAAGCTAAAG 


GTTCTATTAA 


AGAAGGTGTT 


GGGAAAGCCA 


TCGGTGATGA 


AAAAATGGAA 


4500 


AAAGAAGGTG 


CAGCTGAAAA 


AGTTGTTTCT 


AAAGTAAAAG 


AAGTTGCCGA 


AGACGCTAAA 


4560 


GACGCTGTAG 


AAGGTGCTGT 


AGAAGGTGTT 


AAAAACATGT 


TGAGTGGCGA 


CGATAAATAA 


4620 


GGTTAAAAGT 


TACTTTATCT 


TTTTAGTAAT 


ATTAGTCAAA 


AGAGTCTGAG 


TCAAGATGAT 


4680 


TCTCAGAAAA 


CAAAAAGCTA 


GAGATTCCCA 


ATTGCGGAAC 


TCTAGCTTTT 


TAATTTTGCC 


4740 


TCTTTCTCTT 


ATTATATTTC 


AGCAGGTTGT 


TGGCCATGAG 


TACGAATCCC 


ATGTCAATTC 


4800 


TCACTTGACG 


CTTACCTCTC 


AGATGACATC 


TCTTATAACC 


CAAACAAACC 


TTTATCTGCC 


4860 


CAAAGACAGA 


TTTCATATCA 


ATCTTACGTT 


TAGCGAAAAT 


TTGTCTACCC 


TTGGAAGATA 


4920 


AAAGTGCCTG 


ATATTCTTTA 


GTTTTTAAAC 


ACTGGTAACG 


TTCATTCATA 


TACAGTCTCT 


4980 


TTTGAGGGGC 


TGATTCAGGT 


TCATAATCGC 


AGTCAACATT 


GATTTCAAGG 


CTGTTTGCTT 


5040 


TCTATCTCCC 


COG 










5053 


(2) INFORMATION FOB SEQ ID NO: 188: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: liaear 
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<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO : 


133: 






AATTCTCTTT 


TTTCCAACAA 


AATGTATGAC 


CTGCACTTCA 


ATACTTCTCA 


TTGTTTGTAC 


60 


ATTCATCTAC 


TTTCATATAA 


TCTTTTACAA 


AATCATAATA 


TGACATAACA 


CACTATCCCT 


120 


TTTAGACAAT 


ATTCCAATTA 


GCCTTATTAA 


TTCAAAACTA 


TTGTATTAGT 


AATTATAACA 


180 


GATGTATAAT 


AGAAAAGCAA 


TGATAGATAT 


TATCAATTAA 


GCGAATTTAT 


ATCTAAAAGG 


240 


GATATTAAAG 


AAAGGAGATA 


TGCTTATGAA 


GATTTACAAA 


AAACTATTTG 


CTTATGTCCA 


300 


AGATAAGAAA 


TATCTTCGGG 


TTTTGGCCAT 


AATTTTTTCT 


GCTATATCTC 


CTGCACTTAC 


360 


AGTATATGGA 


TATTATTTAA 


TCTACAAATT 


TCTAGATAAG 


TTAATAATTA 


ATTCAAACTT 


420 


ATCCGGTGCA 


GAGAGTATAG 


CATTAAAATC 


TGTTATTACA 


CTAACAAGTG 


GAGCGATATT 


480 


TTATTTTGTC 


TCAGGAATGT 


TTTCACATAT 


CTTGGGATTC 


AGGCTTGAAA 


CAAATTTAAG 


540 


AAAAAGGGaA 


TCGATGGTCT 


GGAAAAAGCA 


AGTTTTAGGT 


TCTTTGACTT 


AAATCCATCT 


600 


GGTCAAATAA GAAAGATTAT 


AGATGPCAAT 


GCTGCACAAA 


CTCATCAGGT 


GGTAGCACAC 


660 


ATGATTCCCG 


ATAGTTCTCA 


GGCAATAATC 


ACACCCGTAC 


TTGTACTTGC 


ACTTGGCTTT 


720 


ATAGTAAGTA 


TAAGAGTTGG 


CATAATTTTG 


CTTGCTCTTA 


CTATAATTGG 


TGGCTTAATT 


780 


TTAGGGGCAA 


TGATGGGCGA 


GCAAGAATTT 


ATGAAGATAT 


ACCAAGAATC 


CCTATCTAAA 


840 


CTAAGTGCTG 


AAACTGTTGA 


GTACGTGAGA 


GGAATGCAAG 


TTGTAAAAAT 


ATTTAAAGCA 


900 


AATGTAGAGT 


CTTTTAAAAG 


CTTTTATAAG 


GCGATAAAAG 


ATTACTCAAA 


GTATGCTTAT 


960 


GATTATTCCC 


TATCTTGTAA 


AAGGCCTTAT 


GTTTTGTATC 


AATGGTTATT 


TTTTGGACTG 


1020 


ATTGCAATTT 


TAATTATTCC 


TATAGTTTAT 


1 X 1 A luAw 1 A 


CLf'WtLr'f^'V x /*• 
W- 1 I /UjL L Au 




1 nan 

1UOU 


ATTTTACTivi 


AGCTTATCA? 


fJi'.iTTTATTT 


TTATCACGAG 


TTCTCTTTGT 


TTCATTCATG 


1140 


AGAATGATGT 


GtACTCCATG 


TATATTTCTC 


AAGGAAATTA 


TGCAGTAGAT 


ACTTTAGAGG 


1200 


CGCTTTACGA 


AGATATGCAA 


AAAGACAAAT 


TAGTGCATGG 


TAATGTCAAT 


AATTTTAAAA 


1260 


ACTATAATAT 


AGAATTTGAG 


AATGTTAGCT 


TTGCTTATAA 


TGATAAAGCT 


GTCATTGAAA 


1320 


ATTTATCCTT 


TAATTTAGAA 


GAAGGAAAGT 


CCTACGCACT 


TGTCGGTTCA 


TCTGGATCAG 


1380 


GCAAATCAAC 


AGTAGCAAAA 


CTTATATCAG 


GTTTTTACAA 


TGTTAATAAA 


GGAAGCATAA 


1440 


AGATAGGCGG 


GATAGCAATA 


AGTGAATATT 


CTGACGAAGC 


CTTAATTAAA 


GCCATTTCCT 


1500 


TTGTTTTTCA 


AGATTCAAAA 


TTATTCAAGA 


AGAGCATTTA 


TGATAATGTA 


GCGTTAGCTA 


1560 


ATAAAGATGC 


GACGAAAGAT 


GACGTTATGA 


GAGCCTTAAA 


ATTAGCAGGA 


TGCGATTTAA 


1620 
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TATTAGACAA 


ATTCCCAGAA 


AGAGAAAATA 


CAATCATAGG 


CTCAAAAGGT 


GTTTATTTAT 

* p> A 9 w A w A *A A 


1680 

A> V W V 


CCGGTGGAGA 


AAAACAAAGA 


ATTGCAATTG 


CTAGAGCAAT 


TTTAAAGGAT 


TCCAAAATTA 


1740 


TTATTATGCA 


TGAAGCATCA 


GCATCTATTG 


ACCCAGATAA 


CGAGTTTGAA 


TTGCAAAAAG 


1800 

A> W W W 


CTTTTAAAAA 


TCTTATGAAG 


GATAAAACAG 


TTATCATGAT 


TGCACACAGG 


CTATCTACAA 

^p A * * A A 0\hV* 


1860 

X O v v 


TTAAAGACCT 


TGATGAAATT 


ATTGTCATGG 


ATAGTGGAAA 


AATTATAGAA 


AGAGGGTCTG 


A 7 £ U 


ACAAAGAATT 


AATGTCAAAA 


GATACAAGGT 


ATAAGAGCCT 


GCAAGAGATG 

^p* ^p » ^p v^vW A 


TTTAACAGTG 




CGAATCAATG 


GAGGGTTTCA 


AATGAAAGAG 


TTTTATAAAA 


AAAGATTTGC 


TCTTACAfiAT 
* w 4 * n%^u/i 1 




GGAGGAGCAA 


GAAATTTAAG 


TAAAGCAACA 


CTGGCTTCAT 


TTTTCGTTTA 

* * • A **^pl AAA r* 


TTGTATAAAP 
a a \j a n a /ui/\v> 


"> i nn 

aIvU 


ATGCTTCCTG 


CCATATTACT 


TATGATTTTT 


GCTCAGGAAG 


TTTTGGAAAA 

• * * * \p*\##^p^pr^pr% 


TATCRfiTil A 


3 t en 

« lOU 


AGCAATGGCT 


TTTATATAGT 


ATTCTCAGTT 


TTGATTTTGA 


TAGCAATGTA 


tnl I 1 1 1 * 




TCTATCGAAT 


ACGATAAATT 


ATATAACACA 


ACCTATCAAG 


AAAGTGCAGA 


1 1 I t*J\\M\/\ I J\ 




AGGACAGCGG 


AG AATTT AT C 


AAAATTACCT 


CTATCTTACT 

A r| A A A *» A 


TTTCT 1 AA A A 


l onl— n 1 a 


t^Aft 


GACATTTCAC 


AAACAATCAT 


GGCTGATATT 


GAAGGCATAG 


AGCATC5PAAT 


fiAfll'V Af"IVa 


rtft 
UU 


ATACCAAAGG 


TGGGCGGCAT 


GGTACTGTTT 


TTCCCATTAA 


TATCIY5TAAT 






GGCAATGTCA 


AGATGGGTTT 


AGCTGTAATT 


ATTCCATCTA 


TTTTAAGCTT 


TATATTTATA 


4JAU 


CCTTTATCTA 


AAAAATATCA 


GGTTAATGGA 


CAGAATAGAT 


ATTATGATGT 


CTTAAGAAAA 




AACTCAGAAA 


GCTTTCAAGA 


AAATATCGAA 


ATGCAAATGG 


AGATTAAAGC 


ATATAATTTA 




TCGAAGGATA 


TTAAAGATGA 


CTTATATAAA 


AAAATGGAAG 


ATAGTGAGAA 


AGTACACTTA 


£ /UU 


AAGGCGGAAG 


TAACTACAAT 


TTTAACTTTG 


TCTATATCTT 

A A # * A • A A W A 4 


CAATATTTAG 


CTTTATATCT 


* f ou 


CTTGCTGTTG 


TGATATTTGT 


CGGCGTAAAT 


CTAATTATTA 


ATAAAGAGAT 


AAATTCTCTC 




TACCTTATAG 


GATATTTACT 


AGCTGCTATG 


AAGATAACAG 


ACTCTTTAGA 


TGCATCTAAA 


4DOU 


GAGGGCTTGA 


TGGAAATATT 


TTATTTATCG 


CCCAAAATAG 


AAAGATTAAA AGAAATTCAA 




AATCAAGATT 


TACAAGAAGG 


CGATGACTAT 


AGCTTAAAAA 


AATTTGATAT 


TGATCTAAAA 


3000 


GATGTTGAGT 


TTGCCTACAA 


TAAAGACGCA 


AAAGTTTTAA 


ATGGTGTAAG 


TTTTAAAGCT 


3060 


AAGCAGGGAG 


AGGTCACTGC 


TTTGGTAGGT 


GCAAGTGGCT 


GCGGTAAAAC 


AACTATCTTG 


3120 


AAACTTATAT 


CAAGACTTPA 


TGATTATGAC 


AAGGGACAAA 


TCTTAATCGA 


TGGCAAAGAT 


3180 


ATAAAGGAAA 


TATCAACAGA 


ATCCCTTTTT 


GATAAGGTGT 


CTATTGTTTT 


CCAAGATGTG 


3240 


GTTCTCTTTA 


ATCAAAGCGT 


TATGGAAAAT ATTAGAATCG 


GTAAGCAAGA 


TGCAAGTGAC 


3300 


GAAGAGGTTA 


AAAGAGCAGC 


AAAACTTGCA AATTGCACAG 


ATTTTATAGA 


AAAAATGGAT 


3360 


AAAGGTTTCG 


ATACAGTTAT 


TGGTGAAAAC 


GGAGCTGAGC 


TATCAGGAGG 


AGAAAGACAA 


3420 
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AGATTATCAA 


TAGCCAGAGC 


CTTCTTAAAA 


GATGCGCCGA 


TATTGATCTT 


AGATGAGATA 


3480 


ACAGCAAGCC 


TTGATGTTAA 


CAACGAGAAA 


AAGATTCAAG 


AGTCTTTAAA 


TAATTTAGTT 


3540 


AAAGATAAAA 


CTGTTGTAAT 


CATTTCACAT 


AGAATGAAAT 


CCATAGAAAA 


TGCAGACAAG 


3600 


ATAGTAGTTC 


TTCAAAACGG 


AAGAGTAGAA 


AGCGAAGGTA 


AGCATGAAGA 


GCTTTTACAA 


3660 


AAATCAAAAA 


TTTACAAAAA 


TTTAATAGAA 


AAGACAAAAA 


TGGCAGAAGA 


ATTTATTTAT 


3720 


TAGGAGGACT 


ACAATGGATA 


ATAAAAAATT 


AAAAGTAAAA 


GATTTACTAA 


GCATCGGTGT 


3780 


TTTTGGCGTA ATTTATTTTG 


CCTTCATGTT 


TGGAGTTGGT 


ATCATGGGCT 


TGATTCCAAT 


3840 


ATTGTTCTTA 


ATATACCCGA 


CAGTATTAGC 


CATAGTTGCA 


GGAACTGTTG 


TTATGTTATT 


3900 


TATGGCTAAG 


GTTCAAAAGC 


CATGGGCACT 


ATTTATATTT 


GGTATGATAT 


CACCACTTGT 


3960 


GATGTTTGCA 


GCTGGTCATA 


CCTACGTAGT 


TGTGGTTTTA 


TCACTTATAG 


TAATGATAAT 


4020 


AGCAGAATTA 


ATTAGAAAGA 


TTGGTAATTA 


TAATTCATTT 


AAATACAATA 


TGCTTTCTTA 


4080 


TGCAATCTTC AGCACATgGA TATGTAGCTC TTTAATGCAA ATGCTTTTAG CAAAAGAAAA 


4140 


ATATATGGAG 


TGGTCTTTGA 


TGACTATGGG 


AAAAGATTAT 


GTTGATGTAT 


TAGAAAAGTT 


4200 


AATAACTTAT 


CCTCACATGG 


CTTTAGTAGC 


CTTAGGTGCT 


TTCTTAGGAG 


GAATTCTTGG 


4260 


AGCATATATA 


GGCAAGGCTC 


TATTGAAAAA 


ACACTTTTCA 


AATGGATTAT 


ATTGTGTGGG 


4320 


ATACTTTACT 


CCTTGCCTAA 


TTTTATGGTG 


CTATCTGAAT 


TAAACCCTAT 


AGTTAAGATG 


4380 


TTTTTGAGTA 


TACCTATTGT 


TATTAGAATG 


TTTATTTTAC 


CATTTATGGC 


AGCAAGCTTT 


4440 


ATGATAAAGA 


CCTCGCATGT 


AGGCGCAATA 


ATTTCATCGA 


TGGATAAGCT 


TAAGATTTCA 


4500 


AAGAATGTAT 


CCATACCTAT 


TGCGGTTATG 


TTTAGATTCT 


TCCCATCTTT 


TAAGGAGGAG 


4S60 


AAGAAAAACA 


TCAAAATGGC 


TATGAGAGTA AGAGGGATAA ATTTTAAAAA CCCAGTCAAA 


4620 


TATCTTGAAT 


ATGTTTCTGT 


GCCACTACTC 


ATTATATCAT 


CTAATATATC 


AGATGACATT 


4680 


GCAAAAGCGG 


CAGAAACAAA 


GGCAATAGAA 


AATCCAATTG 


CCAAGACCAG 


ATACATTCGC 


4740 


GTAAAGATAC 


AGCTAATTGA 


TTTTGTTTAT 


GTTTTAGCGG 


TTGCTGGACT 


TATTGTGGGA 


4800 


GGCTTAATAT 


GGTTGAAATA 


AAAAATTTAA 


GTCTTGATTA 


TGGTGAAGAG 


CATATATTAG 


4860 


ATGATATATC 


ACTATCCATA 


GCCGAGGGAG 


AGTGCGTGCT 


ATTTACAGGA 


AAAAGTGGAA 


4920 


ATGGTAAGTC 


ATCTTTAATA 


AATTCAATCA 


ATGGACTAGC 


TGTAAGGTAT 


GATAACGCAA 


4980 


AGACAAAGGG 


CGAAATAATT 


ATTGATGGTA 


AGAATATAAA 


AAATTTGGAA 


CTTTATCAAA 


S040 


TCTCAATGCT 


TGTTTCAACT 


GTTTTTCAAA 


ATCCTAAGAC 


ATATTTTTTT 


AATGTCAATA 


5100 


CGACATTAGA 


ATTATTATTT 


TATTTCGAAA 


ATATCGGTCT 


TGCAAGAGAA 


GAGATGGACA 


5160 
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GGCGTTTGAA GGATATACTT GAGATATTCC CGATAAAAAA TCTTTTGAAC AGAAATATAT 5220 

TTAATCTATC CGGCGGTGAA AAACAAATTC TTTGCATTGC AGCTTCTTAT ATAGCAGGTA 5280 

CAAAGATTAT AGTTATGGAT GAGCCTTCAT CGAATTTAGA TATTAAAAGC ATAAGTGTTT 5340 

TGGCAAAGAT GCTAAAGATA TTAAAAGAGA AAGGCATAAG CATAATTGTT GCAGAGCATA 5400 

GAATTTATTA TTTGATGGAC ATAGTTGACC GTGTATTTTT AATAGATAAA GGAAAGCTTA 5460 

AAAAAACTTA TACTAGAAGT GAATTTTTAA AGCTAGATAA AAATGAATTA AATGCTTTAA 5520 

GTTTAAGAGA TAAAGAATTA AGTAAATTAA AAGTTCCTTA TTTAAAAGAA GGTGGAGAGT 5580 

ATCAGATAAA AAATCTTAGT TACAAATTTA CTGATGATGA GTGTTTAAGC TTAAAAGATA 5640 

TTTCGTTCAA GCTTGGGAAA ATTTATGGCA TAATAGGATC CAACGGACGA GGAAAATCAA 5700 

CGCTTTTAAG ATGTTTAATA GGTCTTGAGA AAAAATCAAA AGAAGAAATT TATTTTAAGG 5760 

GAGAGAAGCT ATCTAAAAAA GAAAGACTCA AAAACTCTTC ACTTGTTATG CAAGATGTAA 5820 

ATCATCAATT ATTCACAGAT GAAGTATTCA ACGAGCTTAG ATTAGGAGTA AAGAATTTTG 5880 

ATGAAGAAAA GGCGAAAATC ATTTTAAACC CCAATTATTC ACCCCAAATC TAAAAACCAT 5940 

CCAGAATCCT TGCCTTAGCT TAGATCCTGG ATGGTTTCTT TTTTCACCCA ATGGGTGTTT 6000 

TTTACTAGAC AAAAAAGAGT TTCCCCTTTA TGGTATAAGT GTAGAAAAAA ACACAAAAAG 6060 

AAAGGAAACT CACATGAACA GTTTACCAAA TCATCACTTC CAAAACAAGT CTTTTTACCA 6120 

ACTATCTTTC GATGGAGGTC ATTTAACCCA CTATGGTGGT CTTATCTTTT TTCAGGAACT 6180 

TTTTTCCCAG TTGAAACTAA AAGAGCGGAT TTCTAAGTAT TTAGTAACGA ATGACCAACG 6240 

CCGCTACTGT CGTTATTCGG ATTCAGATAT CCTTGTCCAG TTCCTCTTTC AACTGTTAAC 6300 

AGGTTATGGA ACGGACTATG CTTGTAAAGA ATTGTCAGCT GATGCCTACT TTCCAAAATT 6360 

GTTGGAAGGA GGGCAGCTTG TTCACAGCCA ACCTTATCCC GTTTTCTTTC CAGAACTGAC 6420 

GAGGAAACAG TCCATAGTTT GCGATGCCTC AACCTTGAAT TGGTCGAATT CTTTTTACAT 6480 

GTTCACCAGC TG 6492 
(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

AACTGAAGGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATGACCT 60 
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TAAGAAAGCT CAAGAAGACA ACAACTTGGA CGACATGAAA ACAAAACTTG AAGCATTGAA 120 

CGAAAAAGCT CAAGGACTTG CTGTTAAACT CTACGAACAA GCCGCAGCAG CGCAACAAGC 180 

TCAAGAAGGA GCAGAAGGCG CACAAGCAAC AGGGAACGCA GGCGATGACG TCGTAGACGG 240 

AGAGTTTACG GAAAAGTAAG ATGAGTGTAT TGGATGAAGA GTATCTAAAA AATACACGAA 300 

AAGTTTATAA TGATTTTTGT AATCAAGCTG ATAACTATAG AACATCAAAA GATTTTATTG 360 

ATAATATTCC AATAGAATAT TTAGCTAGAT ATAGAGAATT ATATTAGCTG AACATGATAG 420 

TTGTATCAAA AATGATGAAG CGGTAAGGAA TTTTGTTACC TCAGTATTGT TGTCTGCATT 480 

TGTATCGGCG ATGGTACCAG CTATGATATC ATTAGAAATA CAAACATATA AATTTGTAAT 540 

ACCGTTCATA ATTGGTATGA TTTGGACAGT AGTTGTATTT CTTATGATCA ATTGGAATTA 600 

TATAGGCAAA TACTAAGAAG AGACAAAAAT ATATAAATAT TTCTGTACTT ATAGGATATT 660 

TAAAATCAAA ATAAAGTTAA TTTACTTATT TGCAGAGGTT GCAACCCAGC CTCTGTTTTT 720 

CGATAAAAAG GGACGGAATC TCATTTGTTT GGGTTTTGTC TCATCAATAG AAAGGAACAA 780 

AGAGTGTTCG TAACTGAACA CGGGTTTCAG AATTTCTTAC TAAATATAAA AGAAAGGAAT 840 

TGAACCCGAC CTAAATGGTG GTTCGATTCA GAACATCAAT AGAAAGGAAT AAGGGTGTTC 900 

GTAACTGAAC ACGGGCTATG GACTGTGCCA AAAAGATAGT TTTTTCTAGG ACGTAAGCGT 960 

CCGTCGTCAA AACTCCTAGA TGGCTGTGTC CGTTTGACGC CCTTTGTATC TTGAATTATG 1020 

AACAATACTG AATTTTATGA TCGTCTGGGG GTATCCAAAA ACGCTTCGGC AGACGAAATC 1080 

AAAAAGGCTT ATCGTAAGCT TTCCAAAAAA TATCACCCAG ATATCAACAA GGAGCCTGGT 1140 

GCTGAGGACA AGTACAAGGA AGTTCAAGAA GCCTATGAGA CTTTGAGTGA CGACCAAAAA 1200 

CGTGCTGCCT ATGACCAGTA TGGTGCTGCA GGCGCCAATG GTGGTTTTGG TGGAGCTGGT 1260 

GGTTTCGGCG GTTTCAATGG GGCAGGTGGC TTCGGTGGTT TTGAGGATAT TTTCTCAAGT 1320 

TTCTTCGGCG GAGGCGGTTC TTCGCGCAAT CCAAACGCTC CTCGCCAAGG AGATGATCTC 1380 

CAGTATCGTG TCAATTTGAC CTTTGAAGAA GCTATCTTCG GAACTGAGAA GGAAGTTAAG 1440 

TATCATCGTG AAGCTGGCTG TCGTACATGT AATGGATCTG GTGCTAAGCC AGGGACAAGT 1500 

CCAGTCACTT GTGGAGGCTG TCATGGCGCT GGTGTCATTA ACGTCGATAC GCAGACTCCT 1560 

CTTGGTATGA TGCGTCGCCA AGTAACCTGT GATGTCTGTC ACGGTCGAGG AAAAGAAATC 1620 

AAATATCCAT GTACAACCTG TCATGGAACA GGTCATGAGA AACAAGCTCA TAGCGTACAT 1680 

GTGAAAATCC CTGCTGGTGT GGAAACAGGT CAACAAATTC GCCTCGCTGG TCAAGGTGAA 1740 

GCAGGCTTTA ACGGTGGACC TTATGGTGAC TTGTATGTAG TAGTTTCTGT GGAAGCTAGC 1800 
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GACAAGTTTC AACGTGAAGG AACGACTATC TTCTACAATC TCAACCTCAA CTTTGTCCAA 
GCGGCTCTTG GTGATACAGT AGATATTCCA ACTGTTCACG GTGATGTTGA ATTGGTTATT 
CCAGAGGGAA CTCAGACTCG TAAGAAGTTC CGCCTACGTA GTAAGGGGGC ACCGAGCCTT 
CGTGGCGGTG CAGTTGGTGA CCAATACGTT ACTGTTAATG TCGTAACACC GACAGGCTTG 
AACGACCGCC AAAAAGTAGC CTTGAAAGAA TTCGCGGCTG CTGGTGACTT GAAAGTAAAT 
CCAAAGAAAA AAGGCTTCTT TGACCATATT AAAGATGCCT TTGATGGAGA ATAATACTCT 
TCGAAAATCT CTTCAAACCA CGTCAGCGTT GCCTTGCCGT ATATATGTGA CTGACTTCGT 
CAGTCGTATC TACAACCTCA AAACAGTGTT TTGAGCAGCC CGTGGCTAGT TTCCTAGTTT 
GCTTTTTACT TTATAGATTT TTTAAGACTT TCCTAAGTAA TGACGGACGG TAGTGACCTC 
CTTCGAAGTT CCATACCTAA ACTTTGAACC TAAGTTTTAA AGTTTCCGGA CAGCTGAAAC 
CAAGCTGTTT CAGGTGTTTT CATTACGGCA GAAAGTCTTC GATTTAGTTG TGAAATGGTG 
AATGATACTC TTCAAAAATT TCTTCAAACC ACGTCAGCGT CGGCTTGTCA TGGGTATGGT 
TACTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTGAGCTGA CTTCGTCAGT 
TCTATCCACA ACCTTAAAAC GGTGTTTTGA GCAGTCTGTG CCTAGCTTTC TAGTTTGCTT 
TTTGATTTTT ATTGAGTATG AATTACCTAA ATTATGATGC ATAGTTGATG GGATATATAT 
AATAGATTGA AATAGAATAT GAACAAATTG ATAAGAGGAT TTTAAAGTAA TCTCTAACAA 
TGCn-fAGAA ACTATGGTGT GCTATTCTAA ATTCAATTCA CTATAACTTG TTTACGTTTT 
AAAAAAGAGC CGTCGGGCTC TTTTTACTTA TCTTCAGTTC CCTGCATTTC TTTTATCACA 
GCTAGTCTAG TCTGGATATC CTTTTCCAAG ACCTTAAACT TGTAAGTCAA GTCTTCTTGG 
TATTCCTTGA TAAGTTCTTT TTGCTGGTTA ATGATTTGCA GGCTGTTTTG GATAATATCC 
ACATCGTCCT TGATAGCTTG AACGCGGTCA GTGGTATTCA AGACTTCATC TGTGATGGTT 
TGGCGATTTT TTGTAACCAG ataacttccg gctgcagctc CTGCAAATAG CAGTAGGTTG 
GATAATTTCA TAGCAACTCC TTAAGCGTTT TTGATGGTTT CAGCCACTTG AGCAAGTTTG 
TCAAAGTCTG GTTCGTGGGC GATAAAATCA ATCTTGAGGT CATCGTCAGC ACTGTAGCGA 
GGCACAAGGT GAACGTGAGT ATGAAAAACT GTTTGACCAG CGACTTCTTC ACAGTTGGAA 
ATGATATTCA TACCAGCAGC CTTAGTGACT TTCATGACTT TTTGAGCTAC TTTTGGTACT 
TGGCCAAAGA GTTGGcTGGC GCTCGTAGCA TCCATCTCCA AAAGATTGCG ATAGTGTTCT 
TTTGGCACGA CCAAGGTGTG TCCTAGTGTT ACTTGAGAGA TATCAAGAAA GGCAAGGACC 
TGCTCATCTT CATATACTTT TGAAGCAGGA ATTTCCCCTG CGATGATTTT ACAAAAAATG 
CAATCTGACA TAAAATCTAC CTCTACTGTA CTGAATTTTG ATATAATATA GCTACATTAT 
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ACCAGATTTG GAGAAAATAT GTTAGAAATT AAAAACCTGA CAGGTGGCTA TGTTCATGTT 3660 

CCTGTTTTGA AAGATGTGTC CTTTACTGTT GAAAGTGGGC AGTTGGTCGG TTTGATTGGT 3720 

CTCAATGGTG CTGGGAAATC AACGACGATC AATGAGATTA TCGGTCTGTT GGCACCTTAT 3780 

AGTGGCTCCA TCAATATCAA TGGCCTGACT CTGCAAGGAG ATGCGACTAG CTACCGCAAG 3340 

CAGATTGGCT ACATTCCTGA GACGCCTAGT CTGTATGAGG AATTGACCCT CAGAGAGCAT 3900 

ATCGAAACGG TTGCTATGGC TTACGGTATT GAGCAAAAAG TGGCTTTCGA ACGAGTAGAG 3960 

CCCTTGTTAA AAATGTTCCG TTTGGAACAG AAATTAGACT GGTTCCCTGT TCATTTTTCA 4020 

AAAGGGATGA AGCAGAAGGT CATGATTATC TGTGCTTTTG TGGTGGATCC AAGTCTTTTC 4080 

ATCGTGGATG AGCCTTTCCT TGGTCTTGAT CCGCTGGCTA TTTCTGATTT GATTCAGCTT 4140 

TTGGAAGTGG AGAAGCAAAA GGGCAAGTCT ATTCTCATGA GTACCCACGT GCTGGATTCG 4200 

GCGGAGAAGA TGTGTGATGC CTTTGTCATT CTTCACAAGG GAGAGGTGCG TTCCAAAGGC 4260 

AATCTCCTGC AACTACGTGA AGCCTTTGAT ATGCCTGAGG CTAGTTTGAA TGATATTTAC 4320 

TTGGCTCTGA CCAAAGAGGA GGATCTATGA AAGACTTGTT TTTAAAGAGA AAGCAGGCCT 4380 

TTCGTAAGGA GTGTCTTGGT TATCTGCGCT ATGTGCTCAA TGACCACTTT GTCTTGTTCC 4440 

TGCTTGTCCT GTTGGGCTTT CTAGCCTACC AGTACAGTCA ACTCTTACAA CATTTTCCTG 4S00 

AAAATCATTG GCCTATCCTT TTGTTTGTAG GAATTACGTC TGTTTTACTT TTACTTTGGG 4S60 

GAGGAACTGC CACCTATATG GAGGCTCCAG ACAAGCTCTT TCTCTTAGTT GGAGAAGAGG 4620 

AAATTAAGCT CCATCTCAAG CGTCAAACTG GCATTTCCCT AGTCTTTTGG CTCTTTGTAC 4680 

AGACCCTTTT CTTGCTGTTA TTTGCGCCTT TATTTTTAGC AATGGGTTAT GGCTTGCCAG 4740 

TTTTTCTGCT CTATGTGCTT TTATTGGGGG TAGGAAAATA TTTCCACTTT TGTCAAAAGG 4800 

CCAGCAAATT TTTCACTGAA ACTGGACTGG ACTGGGACTA TGTTATTTCT CAAGAAAGGA 4860 

AGCGTAAGCA AGTCTTGCTT CGTTTCTTTG CCCTCTTTAC GCAGGTCAAG GGAATTTCAA 4920 

ACAGCGTTAA GCGTCGTGCC TATCTGGACT TTATTTTAAA GGCTGTTCAG AAGGTGCCTG 49 BO 

GGAAGATTTG GCAAAATCTC TATCTGCGTT CTTATCTGCG AAATGGCGAC CTCTTTGCTC 5040 

TCAGTCTTCG TCTTCTCTTG CTTTCCTTGC TGGCGCAGGT TTTTATCGAG CAAGCTTGGA 5100 

TTGCGACAGC AGTGGTAGTT CTCTTTAACT ACCTCTTGCT CTTCCAGTTG CTGGCCCTCT 5160 

ATCATGCCTT TGACTACCAG TATTTGACCC AACTCTTTCC GCTGGACAAG GGGCAAAAGG 5220 

AAAAAGGCTT ACAGGAGGTA GTTCGAGGAT TGACCAGTTT TGTTTTACTT GTGGAATTAG 5280 

TTGTTGGGTT GATTACCTTC CAAGAAAAAC TAGCCCTTCT AGCCTTACTA GGAGCTGGTT 5340 
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TGGTTTTACT AGTCTTGTAT TTGCCTTATC AGGTAAAACG TCAGATGCAG GACTAACATT 5400 

GCTGATACGA CACTAAAAAA GAAGTTGAGT TCAGTCTGTC TCAACTTCTT TTTTCTTACT 5460 

ACAGGATAAT GGTTGGTCCG TAGAGACTTA TACTCTTCGA AAATCTCTTC AAACCACGTC 5520 

AGCGTCGTCT TACCGTACTC AAGTACAGCT TGCGCCTAGC TTCCTAGTTT GCTCTTTGAT 5590 

TTTCATTGAG TATTAACTTG GTCTTGACTT GGTCAAAGTG GAAGCGGTCA TAGGCCCGCC 5640 

AAGCGGCGCG AGTTGGAGCA TCTGGATCAA GAGCGCTGAG TCCCATGAGA AGACTGGAAG 5700 

TCTGGTAAAA TTTTTCTAGT TCAATCAAGA ATCGATTATC CACTGTTTCA GCCTTGGCTA 5760 

GAAAACCAAG AATAGAGTTT AATTGCTCCT GAAAGCGGAC GTCGTCAGCG CTTGCCTGTT 5820 

TGCATGCTTG GTAGGCTTTG TTTAAGTCAG TAATCAAAGT ATGAGCTCTT TTGATGGGGT 5880 

CTGTATCTGT CATGGGAATG CCTCCTTTAA TCTGGCTCCC AGTCTTACTT CTGGCAACTG 5940 

TGTTTTGATA CTGTTAGTTT ATCACTTTTA ATTCTTTTTT TTTATTCAAA TCTTTAATTG 6000 

TCATTGAAAT GTCTTGAATT GCGCTGAGTG AATTTTATCA TAAAATAGTT GTAAGCTCAT 6060 

CATGATGTTG TAGAAAATAA TCCTTTTAGG AGTTTTCAAA GACTGTTTAG GATTGGGTGT 6120 

GCTTGGGCTA GACCTTTTCT GTTATTCTTT TCTTAGGAGG AGAATCCAAT GAAATATATG 6180 

ATTATTCAGA CGCAGAAAAC AGTCTATAAA GTAAACATCG ACGATATCTA CTATATCCAA 6240 

ACACATCCAA CTAAAGCCCA TACCGTACAG ATTGTTACAG AAGAAGCTAG TTTTAATATG 6300 

CTTCAAAATT TAAGTAATCT TGAGAACCAA TGTGGGGAAA CCTTGATGAG ATGTCATCGA 6360 

AATTGTTTGG TTAATCTTGA TAAATTAAAA TCGATTGATT TTCAAGAAAG AATCCTTTTT 6420 

CTCGGAGAAG AAGGTCAATA CGCTGTCAAG TATGCCAGAC GTCGCTATAG AGAAATTCGT 64 80 

CAAAAATGGT TGAAAGAGGG AGAGTAAGAA GATGAGAATA TTTGTTTTAG AGGATGATTT 6540 

TTCCCAACAG ACTAGAATTG AAACGACGAT TGAGAAACTT TTGAAAGCAC ATCATATCAT 6600 

TCCTAGCTCT TTTGAGGTAT TTGGCAAGCC GGACCAACTG CTGGCTGAAG TGCATGAGAA 6660 

GGGGGCCCAT CAGCTATTCT TTTTGGATAT TGAGATTCGA AATGAAGAGA TGAAGGGACT 6720 

GGAAGTCGCT AGAAAGATTC GGGATCGGGA TCCTTATGCC CTGATTGTCT TTGTGACGAC 6780 

TCACTCGGAG TTTATGCCCC TGTCTTTTCG CTACCAAGTG TCTGCTTTGG ACTACATTGA 6840 

TAAGGCCTTG TCAGCAGAGG AGTTTGAATC TCGGATCGAG ACAGCCCTCC TCTATGCCAA 6900 

TAGTCAAGAT AGTAAAAGTC TGGCGGAAGA TTGCTTTTAC TTTAAATCAA AATTTGCCCA 6960 

ATTTCAGTAT CCTTTTAAAG AGGTTTACTA TCTCGAAACG TCGCCCAGAG CCCATCGTGT 7020 

TATTCTCTAT ACCAAGACAG ACAGGCTGGA ATTTACAGCG AGTTTAGAGG AGGTTTTCAA 7080 

GCAGGAGCCC CGTCTCTTGC AGTGCCACCG CTCTTTTCTC ATCAATCCTG CAAATGTGGT 7140 
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GCATTTGGAT AAGAAAGAAA AACTGCTTTT CTTT 

<2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3207 base pairs 
<B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CCACCAGGGA AAATCATTGA AGTTGGTAGT CACCAAGAGT TAATGCAGGC GCAAAGTTTC 60 

TACCATCATC TATTCAATAA ATAAGGAGAA TGTCATGAAT CCTAATCTTT TTAGAAGCGT 120 

CGAGTTTTAT CAGAGACGTT ACCATAACTA TGCGACAGTG TTAATTATAC CTCTTTCATT 180 

ACTATTTACT TTCATCTTGA TTTTCTCCCT TGTTGCCACA AAAGAAATTA CTGTTACTTC 240 

CCAAGGAGAA ATCGCCCCTA CAgTGTCATT GCCTCCATTC AGTCAACCAG TGATAATCCT 300 

ATCCTAGCTA ATCATTTAGT GGCAAATCAA GTAGTTGAAA AAGGGGACTT ACTCATCAAA 360 

TACTCTGAAA CAATGGAAGA AAGTCAGAAA ACTGCCTTAG CAACTCAATT ACAAAGACTT 420 

GAGAAGCAAA AAGAAGGACT TGGAATTTTG AAACAAAGCT TAGAAAAAGC GACTGATCTT 480 

TTTTCTGGCG AGGATGAATT TGGCTACCAT AATACCTTTA TGAATTTTAC TAAACAATCC 540 

CATGATATTG AACTGGGTAT CACAAAGACT AACACCGAAG TTTCAAATCA AGCTAATCTT 600 

TCCAATAGCA GTTCATCAGC TATTGAACAA GAAATTACAA AAGTTCAACA ACAAATTGGA 660 

GAATATCAAG AGTTGAGAGA TGCTATCATA AATAACAGAG CACGCTTACC AACTGGCAAT 720 

CCGCACCAGT CAATTTTGAA TCGTTATCTT GTAGCCTCAC AAGGACAAAC ACAAGGAACT 780 

GCAGAGGAGC CATTTTTATC TCAAATTAAT CAAAGTATTG CAGGTCTTGA ATCATCTATC 840 

GCAAGCCTCA AAATTCAGCA AGCTGGTATC GGAAGTGTAG CAACTTATGA TAACAGTTTA 900 

GCAACCAAAA TTGAAGTACT CCGCACTCAG TTTTTACAGA CAGCCTCACA CCAACAACTA 960 

ACTGTGGAGA ATCAATTAAC AGAATTAAAA GTACAACTAG ATCAAGCCAC ACAGCGTTTG 1020 

GAAAACAATA CCTTAACCTC CCCAAGTAAA GGTATCGTTC ATCTGAACAG CGAATTTGAA 1080 

GGTAAAAATA GAATTCCAAC TGGTACAGAA ATTGCTCAAA TATTCCCTGT CATCACAGAT 1140 

ACAAGAGAAG TACTAATCAC TTACTACGTA TCTTCTGACT ATCTACCTCT ACTAGATAAA 1200 

GGACAAACTG TAAGATTAAA ACTGGAGAAG ATTGGAAATC ACCGCACCAC CATCATCGGC 1260 

CAACTTCAGA CAATTGATCA AACTCCTACC AGAACAGAGC AAGGAAATCT CTTTAAATTA 1320 
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ACCGCTCTTG CAAAACTATC TAACGACGAT ACTAAACTCA TCCAATATGG CTTACAACGT 1380 

CGCGTCACTA CTGTAACTAC AAAGAAAACA TATTTTGATT ATTTCAAAGA TAAAATTTTA 1440 

ACACATTCTG ATTAATTTTC AGATAACACT CTATAACTAT TTATTATCTT ATCAAAAACG 1500 

AGAATCATAA CATGGATAAG AAACAAAACC TAACTTCATT TCAAGAACTA ACAACTACCG 1560 

AACTCAATCA AATTACAGGT GGAGGATTGT GGGAAGATTT ATTATATAAC ATTAATAGAT 1620 

ATGCTCATTA CATCACATAA GAACTTCATC ATCCAATACA ACTATAAAAA AATAAGACCG 1680 

AGAAACAAGT ACTCTCGGTC TTATTTTTCA TCATTCTGTA TCTATCACAG TAAGTACCTG 1740 

ACGAAAGACT TGATTTTGAC AGGTGGTATT TAGACTGGTA TTAGGATGGC TTTCCACAAT 1800 

CTTCATGACG GTATAGAGAC CAACTCCTCT CTCCTCCCCT TTAG AACTGG CTCCAAAGGA 1860 

GAAGATTTCA GAAATATCGA TGCCCTCTTC TTTGATGGAG TTTTCGATGA TAAAGGTCTC 1920 

CTGTGCTCCA TTTTTTAAAA AGCCGATTGA AACATGAGGT TGACTAGCTT CCACACTGGC 1980 

TTCAATAGCA TTGTCACAAA GGATAGACAC AATGGTTAGA AAATCAAGTA GACTCATCCC 2040 

CTCGACCTGA ATCTCCTCAG GAACTTCGAC ATTAAAGACA ATGTTCTTAT CTCTGGCTTT 2100 

TAAAAATTTC CCTGCTAGAA GACTTTTGAG GGCTTTATCA CGAATATTTA CCAATCTGCC 2160 

CAGGTCATAT TTATTGTTCT GCAATTTCTG ACTGGAATCC TTTAAGACGG AGCCATAGAC 2220 

CTCTTTTATC TGCTCCATAT CCTCCTCTTC AATGCCCAGA CGTAAGCTAG TCAAGAGGTT 2280 

GGTATAATCA TGACGAAAGC TCCGTACTTC CTTCTAAAGC TCCTCTATAT GCCGACTATA 2340 

GCGTTCCATA TCTCTATAGC GCAGGGCCTG CTCTTGTTCC AATCTCTCAT AGAGTTTTTC 2400 

CTTCAAATAG GTATCCAATT TCTTGATAAC CCCCATAAAA AAGAGTAGGT AAAAGACTAG 2460 

GATGAGATGG CGAACAGTCT TTGATTGAAT ACTTTGTTCA TATTCAAAAA AAGACAGACT 2520 

TTCCATGACT AGATAGTAGC CACCCATTAT CCAGTTAATC TGAGTCAGGG ACTTTTGAAA 2580 

GGCTTTATCG AGAATCTCCT TTCTCAAGCT AGTAAAATCG TAGTCCAACC ATTTCAAAAA 2640 

AGCTAGAGAA ATGAAGAAAT TGAAAATTAT TATACATAAC CCAGTAAATG AGTAGCCATC 2700 

ATATACTTGC CCTTGTCCCA AAAATGGAAG CACAAAATAG GAGACTCCTC TATAAAAGAG 2760 

ATTCACCAAT ATCATTGGAA AGAGACCATA AAAGAAAAGG AGTTTTTTAG GAAGCCCTCT 2820 

CAATAATAAG AAAGATAAGC CTATGCCGTA CAAGGGTTCC ATAAAATAAG ATAGGTAAAC 2880 

ATTTCCTACT ATATAGCTAA TCATCACAAA AACAAAGGCC AACAGTATCT TCAAAAGAAA 2940 

GGCCTTAAAA ATCCTCTCGA AAGTAAGATC AATTCCATCC ACCTTAAAGA AGATGACAAT 3000 

TTCTAGTCCA TTAGTAACAA GTGTATACAA CAATATCCAA GCAATGTTCA TAAATTCTCC 3060 

TAGCTCAGTG TAATTTATTG ATGGCCTCAG ACACTTCCCT GACCTTATAA CGGGCGATTA 3120 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT 3207 
(2) INFORMATION FOR SEQ ID NO: 191: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10357 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION; SEQ ID NO: 191: 

CTGAATCAAG TGTACTGCAC CAGTTCGTGC ATCAGGCATA ACAACATCTA CAGATATAAT 60 

ATTGTTTTCT GAGTCCGCCT CATAAGTTAA AATCATAAAT TTTTCGATAT TCGAATTTTT 120 

AGTAGCTTGT TCAATTTCTT GAATCATTTC ATCAGAAACT AACTCCATCT GAATTGGAAA 180 

GGAATGACTA TTTTCATCAT TTTTCTAGGA AGAATGTTGA TTAAGATAAA GTGTATTCAT 240 

CTGAGCATAT TCAAATAAGT AGCCACTCTT ATTTTTTTGT ACCAAAGGAA ATTGGTTTGT 300 

AAGTCGCTTC TTACCCTTTA TAATTAACAA TACTTTCCCA TATTTTTCTG TATTTGTTTC 360 

AAATTCTAAA TATCCCCAAG TCTGTCCTGC TAATTGTAAT TTATACTCAA ACAAATCTGC 420 

TGATGCAAAT GCAGTATCAA TATGATTAGG TCGCGTCCAT GCATAACCAT TCGACACTAT 480 

CATTGTCTCT CTTTTTTCTA GACGTTCATC TACATAATCT TTTTGCCCTT TCATCAAAGT 540 

ATCTACAATT TTTTGTGCCT CAAGCGAATC AAAGAGATCC TGATTCAACA TAATTCTTCC 600 

TCCTCCAAAT ACTTTTTAAT GAATTATACC ATTTTCTTAA AGAAATTACT ACAATAATTA 660 

TCTTTTTCTT AAAGTTCTGT GTCAGAGTAA TTTAGAAAAT TATATCTTCT ATAGTAAAAT 720 

CAATTAAAAA CTGAACAAAT TTATTGGGAA ATTCAAATCG CTTTCTGAAA ATATTTTAGG 780 

AACCGTAGTG TAATATTCCA GATTCAATTC ACTATAAAAC TGACCTTTCT CCTGCAAAAG 840 

AAAAAGGAAA GACTTCCTTT CGTGCCTTTC CTCTTACTTG CTACTTGTTT GATTATTTTT 900 

GGTAAGCTAC TGCTTGTCTG ATAAAATCCT GAATCGGCTC TCCTTGGTGG AGAGCTTTTA 960 

CTATTTTCGA ACCGACGATA ACACCATCTG ACACCGCATT GAAGCGTTCC AGATCGGCTT 1020 

GACTAGATAC ACCAAAACCT GTCAAGACTG GGATGTCGGC CACTTGATGA AGTTGCGCCA 1080 

AGTGCTTGTC CAAATCTGCA CGGTAATTGC CTGATTTCCC TGTCACTCCA TTGATGGCAA 1140 

CGGCATAGAT GAATCCCTCC GCCCCTTCAA TCAACTCTTT CTGGCGCTCA ATTCCTGTGG 1200 

TCAAGCTTAC TAAAGGAATC AAGGCGATAT CTGTATTTGC CAAAAATGGT TCTACAAAGT 1260 



WO 98/18931 



PCT/US97/19588 



1152 



TGGCATGTTC 


ATGAGGCAGG 


TCTGGGATAA 


TCAAGCCCTT 


CACAGCTGTA 


TCAGCCAGAT 


1320 


CTTTGACAAA 


GTTCTCCACA 


CCGTACTGAA 


AGAGGGGGTT 


GAAGTAGGTC 


ATGATGACCA 


1380 


GTGGAATCTC 


TGTTTCAATG 


GTTTTCAAGG 


TTTCAACTAA 


AGCCTGGGTA 


GAGGTCCCGT 


1440 


GGGCTAAACT 


GCGCAAGCCA 


GCTTCTTCGA 


TAACAGGTCC 


ATCTGCAACA 


GGGTCTGAAA 


1500 

A J V V 


AGGGAATACC 


CACTTCAATT 


GCAGAGACAC 


CCAAATCTTC 


TAAAAAGTGA 

™ ■ ^ m ^™ *P PP P>XpF 4. P 4 


ATTC?P*P - Pf* Af3 


ISfiO 

IJWU 


CAAGACCGTC 


CAAACCTTTC 


TCGTGGTCAC 


CAGCCATGAT 


ATAGGGAACA 


AAAATTCCTT 


1620 


TTCCAGCTGC 


TTTAATAGCA 


TTTAATTTTT 


CTGTTAGTGT 


CTTAGGCATG 


AGCTTCTCCC 

■»X*Xpj m p> \p> A Vp\v\p- 


16B0 


TTCTTTGCTG 


CATCTGCTTC 


CAAGCGGTCC 


TTGACTTGAA 


CCACATCCTT 

xp x»p * x» • * *. x* x» pj • 




1 740 


CCTGATAGGC 


AGACAATCAT 


AGACTTTTCT 


GGTCCAAGTT 


CTTTGGCCAA 

xp * * * xpTx*^, V#/»f\ 


4, 4 4> WlWvWvA 


1 noo 


AAGGCGATAG 


CATGGCTAGA 


TTCCAAGGCT 


GGGATAATCC 


CTTCCACACG 

^pr- P> P> X> P"#jXjjjji 9 ^■XP'^J 


AGACAAC3AGT 


A O O w 


TGGAATCCTT 


CCAAGGCTTC 


TTCGTCTGTC 

4 ♦ ^p"X^ p> Xp * XpI p) X» 


ACAGGGACAT 

9 IN* P ^XyXpTXpry P Xpp \ p 


AGCTGGCACG 






TGGTAGTGAG 


AATGCTCTGG 


ACCGATACCA 


GGATAGTCCA 

WVfl pj #• ^kJ A Xjjr^pr-pra 






iJOU 


GCTTCAAGAA 


TTTGACCATG 


GGCATCTTGG 


AGCACATCCA 

• Pr*pl ^P-PT»^p.Pf^ * XP W • 


TGAGGGAACC 

* >pT*^prxpTvptpT#^^ x- 


VP » WWAJnvA 




CCTGGACGAC 


CCTTGGTCAA 


GGTAGCTGCG 

X^Xp* #j ■ pXpI X» * XV X* X* 


TGGTGCTCTG 


TATCCAC*APP 






GCTTCAGTTC 


CATACATAGC 

m m p> • Pj%p4Fpj pj pTXjP'^p* 


TACTGACTCA 


TCTTCTACAA 


AfinnATTtnA a 






GCATTCGACC 


CACCACCAAC 


ACAGGCTACT 


AGGGCATCTG 


GCAGATCTCG 

4 Xjj» • N«\pT 




5720 


TCACGGTACT 


GTTGTTTAGC 


CTCTCGACCG 

X» P> XP W X» X^P»XP Xp ^* 


ATGACACTTT 


GGAAGTCACG 

vvrv^^pV 4> x^ J>Xp x? 






GGAAATGGAT 


GAGGCCCCAA 

*p'Pj»*pF»p»^p) ^p*^p> •» V 


GGCAGAACCA 


AGGATATAGT 


GGGTATCGTC 

vvv 4 f\ A A x* 


UninJ a AVA.V. 




ACCCATGAAC 


GAAGGGCTGC 


ATTGACCGCA 


TCCTTGAGCA 

* XpXp * * Xp 1 * »XWP-p» 


CGCGCGAACC 

Xp X# Xp X#XP XVppp^xx^- \v 




2400 


GCCTCGACCT 


TGGCTCCCAA 


AAGCTCCATG 


CGGAAGACAT 


TGAGGGCTTG 

•> VAVAJW p> p) \# 




24^0 


TCTTCCTCAC 


CCATGTAGAT 


GGTACATTCC 


ATGTTAAAGA 


GGGCTGCAGC 




2S20 
« J* u 


GCCACACCGT 


GCTGACCAGC 


ACCCGTTTCT 


GCCATAATTT 


TCTTTTTACC 

P> 4 P> -Pi 4 -P> PTp^v %■ 


CATGCGTTTG 


25S0 


GCAAGCCAAA 


CTTGTCCTAA 


GGCATTGTTA 


ATCTTGTGGG 


CTCCTGTATG 


GTTAAGGTCT 


2640 


TCCCGTTTGA 


GATAAATCTT 


GGCTCCGCCA 


ATATGCTGGG 


TCAAGTTTTT 


TGCGTAATAA 


2700 


AGAGGAGTTT 


CACGTCCTAC 


GTACTGGCGC 


AAAAGCTGGT 


TTAATTCCTC 


TTGGAAACTT 


2760 


GGGTCTGCCT 


GACTTTCACG 


GTAGGCCTTC 


TCCAACTCCA 


AAACTGCTGT 


CATCAATGTT 


2820 


TCTGGGACAA 


AACGTCCGCC 


GAATTTTCCG 


TAAAATCCAT 


CTTTATTTGG 


TTCCTCATAT 


2880 


GCCATGCTTT 


ACCCTCTCTA TAAATCTTCT 


AATCTTTTCA 


TGATCTTTTT 


GTCCATCTGT 


2940 


CTCCACTCCG 


CTCGATACAT 


CTACTGCATA 


GGGAGTAAAG 


TGTTGAATTG 


CTTTTACTAC 


3000 


ATTATCTTCA 


TTAAGGCCAC 


CTGCGATAAA 


GAAGGGCTGT 


GCTAGTCCAG 


TCGTATCCAG 


3060 
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TTGACCCCAA TCAAAGGGCT GGCCACTTCC TGCCACACGG GCATCAAAGA GTACATAATC 3120 

TGCCTGAGAA TTCGGGACAT GCCCATTTCC ATCTACCTGC ACAGCCTGAA TACTGGCACA 3180 
AGGCAAATTC TCAAATAAAT CATCTGCCAC CTGACCGTGA ACTTGAACCA AGTCCAAGCC - 3240 

AACTTTGTCA ATCGCTTCCA GCAGTTCTAC CCGACTTGGT GAAACAAATA CTCCAACCTT 3300 

TTTCACATCT GCAGGAATAA GCTTTGCCAA CTCAGCTGCC TCTTCTAAAG TCACCTGTCT 3360 

TTTACTAGGT GCAAAGACAA AACCGATATA GTCGGCTCCT GCTGAAACGG CTGTTTCCAC 3420 

CGCTTCTTTG GTCGATAGTC CACAAATTTT AACCTTTGTC AATCTGCAAC TCCTTGATTC 3480 

TCTGGGCCAC ATTTTCTGCC TGCATAAGAG CTGTCCCTAC CAAAATTCCG TTAAAGTATG 3540 

GGGCTAGTCG TTCCGCATCC TGCCCTGTGA AAATGGCAGA TTCAGAAATG TAATAGCGAC 3600 

CTTCCTCAAA GTAAGGGGCT AAATCTACAC TGGTCTGCAA GTCGACCTCA AAGGTAGTCA 3660 

AGTTGCGGTT CTTGACCCCG ATAATCTCAG CACCAAGTCT GTGGGCTACC TCTAGTTCAG 3720 

CTAGATTGTG AGTCTCCACT AAGACTTCCA GACCAAGCTC TGTCGCGTAG TCATACAGTT 3780 

CCTTGAGGCG TTCTTCGGAC AAGGCTGCCA CAATGAGCAA GATAACTGTC GCACCTGCAT 3840 

TGCGAGCGCG GATGATTTGC TTTTCATCGA TGATAAAGTC TTTGTTGAGC GTCGGAATCT 3900 

CTACCTGACT GGAAATTTCC CGTAGATAAT CCAAATGCCC TTTAAAGAAA ACCTCATCTG 3960 

TCAACACCGA AATCATCACT GCTCC G TTT T CTTCATAAGT CTGGGCCTGT TGCACAATAT 4020 

CCACATCGAG ATTGATATCT CCCAAACTAG GGCTAGCTTT CTTGACCTCA GCGATTACCT 4080 

GCAAGCGGTC CTGATGATTC TTCAAAAATT CTGCCAAGCG ATAGGTCTGG CGCAGAGGCT 4140 

GGATTTGCTC CAGCTTCATC TGCTCCACCT CACGCGCCTT CTGCTCTAAG ATTCGTGCTA 4200 

AAAATTCCTG ACTCATTTTT GGTACTCCTG TAACAGTCTG AGTTTTTCAA GGGCCTTGCC 4260 

TCTAGCAATC ACTTGACGGG CCAAGGCAAC CCCTTCCTTG ATGCTATCAA TCTTACCATT 4 320 

AGCATAGAAA CCAAGACCAG CATTCAAGAC TGTCGTTTCC AAGAATGGAC TTGCTTCGTT 4380 

TTTCAGAACG CTAAGCAAAA TTTCTGCATT TTCCTGAGCA TTCCCACCAC GAATATCTTC 4440 

CATAGCATAG CCTTCCATTC CCAAATCCTC TGGAGTAAAG CTTGACAAGC TGATTTCGCC 4500 

ATTTTCAAGA AGTGCAATCT TGGTTCTTCC GTTCAAGCCA GCTTCATCCA ACCCTTCTGG 4560 

TCCAGCAACC ACGATGGCAC GTTTGCGACC CATATTTTTC AAAACCTGAG CTGTACTTTC 4620 

TAGGAGTTCT GGACGACTAA TTCCAAGAAG CTGTGTTTCT AAAGCCATTG GATGAATCAG 4680 

TGGACCAGTC AAGTTCATAA TCGTTGGAAT TCCCAATTCC AAACGAGCTG GCATGATGTA 4740 

TTTCATAGCT GGGTGCATAT TTTTAGCGAA GAGAAAGACG ATTCCAGTTT TATCAAAGAC 4800 
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CTTACCTAGT TCAGCTGCTT TGAGGTCAAG ATTGATTCCC AAGGCTTCCA GGACATCTGC 4860 

GGAACCAGAT TTAGAAGATA TCGAGCGGTT ACCGT G TTTG GCCATGTGAA TACCGCCACC 4920 

AGCCAAGACA AAGGCTGCAG TTGTGGAAAT ATTAAAACTG AAAGACTTGT CCCCACCTGT 4980 

ACCACAGTTG TCCATGGCAT CATGAATCTC AGTTGGAATA TGCTGGGCAT GTCCTCTCAT 5040 

GACTTGGGCA ATGGCTGTGC GTTCTTCAGG TGTTTCCCCC TTCATCTTAA GAGCTAAGAG 5100 

GAGAGAAGCA ATCTGCGCTT CAGTTACACG CCCAGTTACG ATACGCTCAA TGACATCCGT 5160 

CATTTCCACA CCTGATAAAT TTTCAAATTT TGCTAGTTTT TCAATAATCT CTTTCATCCT 5220 

AGTTTCCTCA CTTTACAACC TCCTCGATAA AATTCCGAAT AGAAGACAAG CCGTCTGGCG 5280 

TTCCAATGCT CTCTGGATGG TACTGGAAGC CATAAATCGG TAGGTTTTTA TGTTGAATCC 5340 

CCATGATGGC TTGGTCATCA GTCGAACGAG CTGTCACTTC AAAGTCTTCT GGCATTTCCT 5400 

CAATCAAAAT ACTGTGATAA CGCATGACCG CACGGCCATC CTCAATACCT TGATACAAAA 5460 

CAGATGGCGC TTCAAAGTTG ATATTGCTCT GTTTCCCATG CATGACTTTT GGAGCCAAAC 5520 

CTAGCTTACC ACCAAAGACT TCTGCAATGG CTTGGTGGCC CAAACAAATC CCAAGAATCG 5580 

GCTTCTTGCC TGCAAAATCA CGAATCATGT CTTCCATCTT TCCAGCATCA ACTGGCCAAC 5640 

CAGGACCAGG AGAAAAGACC AGACCATCTG CTTTTTCAGC TTCTTCATAC AGCTTGGAAT 5700 

CATCATTTCT CAGAACCTGA ACTTCTGCAA AATTCCCAAT GTATTGGGCC AAGTTATAGG 5760 

TAAAAGAATC ATAGTTGTCA ATCAATAAAA TCATGGTCTT AGTTCTCCAA TTCTAGTCAT 5820 

AGATTTTGCT TTGTTAATGG TTTCTTGGTA TTCGTTTTGG GCGATAGAGT CGTAGACAAT 5880 

CCCTGCCCCA GCCTGCACAT AGGCTCTTTG ATTTTTGAGA ATCATGGTTC GGATGGCGAT 5940 

GGCCAAATCC ATATCACCCG TCGCAGACAA GTAGCCGATT GCCCCAGCGT ATACTCCCCG 6000 

TTTTTCCGTT TCCAGTTCAT AGATACGTCT CATCGCTCGA ATCTTTGGTG CTCCAGAAAC 6060 

GGTTCCAGCA GGAAGCGTTG CTTTCAAGGC ATCCATGGCA GTGAGTTCTG GAAGCAAACG 6120 

CCCCTTGACT ACGCTGGTCA AATGCATGAC GTAGCGGAAG AGCTCCACTT CCATATACTT 6180 

AGTGACTTGG ACACTGGTCG TTTCAGAGAT GCGGCCAATA TCGTTACGCC CCAAGTCTAC 6240 

CAACATTCGA TGTTCTGCTG TTTCCTTCTC ATCAGAGAGG AGGTCAGTCG CCAAGGCCTT 6300 

GTCTTCTTCA TCCGTAGCCC CTCTTGGTCG CGTCCCTGCA ATCGGATTGG TTGTCACGAT 6360 

GCCATTTTTG ACAGAAACCA AACTTTCTGG ACTAGCTCCG ATGATTTGAT AATCCCCAAA 6420 

ATCATAGAAA TAAAGGTAAT TAGAAGGATT AGTCACGCGG AGATTTCTGT AGAAGTCAAA 6480 

TGGATTTCCA GTAACTTCTG CTGAAAAACG CTGGCTGAGT ACACATTGGA ACATATCTCC 6540 

GTTACGAATC AAGTCACGAG CTGTTTCTAC CATTCCCTCA AACTTATGTG GAGCGATATG 6600 
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CGGTTTGAAG 


TCTAACGGAG 


ATAGATCCAA 


. ATCTTCAAAT 


1 TCATTTGGAG 


; CAGGAATGCG 


6660 


TAATTCCTCA 


AGCACTTGGT 


TCAAGGATTT 


TTCCAAGGCC 


TCTTGACTGC 


GCTCACTATA 


6720 


AAGTGCATCC 


TCTATGACAT 


GTATCTTCTC 


CTTCTTGTGG 


TCAAAGACCA 


TATAGCTCTC 


6780 


ATAGACAAAG 


AAATGCATGT 


CTGGCGTCCC 


AATTGTATCC 


TCAGGGATTT 


GACCAATTTC 


6640 


TTCATAAAGC 


GAAATCATAT 


CGTAACCCAC 


AAAACCAATG 


GCTCCACCAC 


CAAAAGGTAG 


6900 


CTCTGAGTGG 


TGCTGACTCT 


TATGAATCAC 


TTCATAAAGG 


AAATCCAAGG 


GATCCCGATC 


6960 


AATCACTTGA 


CCATTTTGAT 


AGAGAACCCC 


ATTTTCAAAC 


TTAATCTCAA 


AAACTGGATT 


7020 


ATAGGCTAGG 


ATAGAAAAAC 


GAGCTGTTTC 


CTTGTCTCTC 


GGAATACTCT 


CTAAAATAAC 


7080 


CTTATGTTGC 


CCCTTTAAGC 


GCATATAAGC 


CAAGATTGGT 


GATAAGACAT 


CTCCATGAAT 


7140 


GATTCGTTCC 


ATTGTAATTT 


CCCTTTCAGT 


TCTACTTCTA 


GTCCGTGGTG 


ACTGTATGAA 


7200 


AAATCCCCAC 


GCAAAATAAC 


TTGCGTGAGG 


ACGAAATTCG 


CGGTGCCACC 


TCAATTATAG 


7260 


GATTTCTCCT 


ATCTCTCATT 


CCTGTCTCAG 


ATATCTCCTG 


TAACAGGCTG 


TGCGATAAAG 


7320 


GGCACTCCCT 


TGAGAATGAT 


GTTTTCTTCT 


CTCGTTTCAG 


ATGAACCCAA 


CTTTACAGCT 


7380 


TTCTCTGCTT 


GTTTTCAGCA ACCACAAGCT 


CTCTGTGAGA 


GAAAGAACTG 


TAATTTTTCC 


7440 


ATCTATTATT 


TTTTAGCTTC 


TAGTAGTCTG 


CAATCGCAGC 


TAGGTCCTTG 


CCTCCACGAC 


7500 


CAGAGACATT 


GATGAAGAGA 


TGTTCATCTC 


GGTACACCTT 


TATACTCTTC 


GAAAATCTCT 


7560 


TCAAACCGCG 


TCAACGTCGC 


CTTGCCGTAG 


GTATGGTTAC 


TGACTTCGTC 


AGTTCTATCT 


7620 


GCAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


7680 


TGTTTTGAGC 


TGACTTCGTC 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


7740 


TCGTCACTTC 


TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGC 


AGCCTGCGGC 


TAGTTTCCTA 


7800 


GTTTGCTCTT 


TGATTTTCAT 


TGAGTATTAC 


TAGCTTTTTT 


CGTATTAGTC 


CAGCCTTTTT 


7860 


GTTTGCTTTT 


AGTAGTAGGC 


ATGGAGCTGT 


AGATAGAACT 


CAAGTTCATC 


AAAGCGACTT 


7920 


AAGGCCCTAA 


TAAAAGATAA 


ACCAAACGAC 


GGATAGAAAA 


AAGCCCACAC 


ACAGAATATA 


7980 


CTTCCGTGTG 


AGGGCGTTGG 


TAACGCGGTG 


CCACCTCAAT 


TATAAAGGGA 


CTATCCCTTT 


8040 


ACATCTCTGC 


ClWriTAAC 


AACAAGCTGC 


ACTGTAAGGT 


GTGCGCACCG 


AATTTTCATT 


8100 


GTTTCAAATT 


CATTTTCAAA 


ATCAGCCCAC 


TTTCACTACT 


TCCAACCACC 


TATTCACAAT 


8160 


CACCACAGGC 


TCCCTGAAGA 


TCAAAAATAG 


TTACTTTTCT 


GATTTGTTGA 


ACTTATTTTA 


8220 


ATACTTTGTT 


TTTTCTTTGT 


CAAGACTTTT 


TTACGATTTT 


TTTGAAAATA 


TCATTCGAAT 


8280 


ATCACCATGT 


CTTCCTTAGA 


TCGAACATGA 


ACATGTCCCA 


CTTCTTAGAA 


ATTGGATCCA 


8340 
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ACTCAATAGA AACTGAATGG AGGCTAAACA GAACTTATTT TAGAACACTC 


' CATCTTTTCC 

\* n A W A A A A 




ACTAGGATTT TCAAGAATTA AACAATACTA GAAACTCTGT CTCCTAACAA 


ATTTAGGAGA 




AACTTCAACA GATGTGACAC TTTCCCCTTT AATAATTGCT AAAACACCTT 


CT ATf* ATTTf* 

W A A V>n AAA w 


WJtU 


TTTAGCCAAT TTAACATAAT TGGGAGCAAT TGTAGACAAA GCTGGAGTAT 




peon 


AATAGGAATA TTATCAAATC CAATGATAGA AATATCATCT GGAATAAGAA 




OQ4U 


ATAGCACGCA CGAATCAAGC CCTGAACCTT TTCATCTCCT GAAACAAAAA 


TA A*Ty*T/^^V*^* 


O f 00 


ATAATTTTGG GTAGTCAAGT GCTGCATTGC ATAAGAATAA ACTGAATCAA 


l luinunlnA 


Hi en 
9 / DU 


GCCATAAATG ACTTTTAAAT CCATAAAGTA ATTTTTATCA TTCAGAAAAG 


A A f*f;f*Af*Arr 


Q fi") A 


TCTTTCACGA TCCTTATTAA CATGGGATTC TCCTCCCATA AGCAACCACA 


l A 1 i 1 1 1 #v\A 


QQQfl 

OOOU 


TTTTTCTTCA GTTACAGCTT TCATCATATC ATAAGTAGCT TG AAAATT AT 


inl 1 nuA 1 Av> 


tf?4 U 


ATAGACTACT CCAGACGTTT GAGATTCACC GAAAACAAGA AAAfiGTATAT 


uU 1 1LI l\. I 1 


nnnn 

9000 


TAAATACTGA ATT CTGAT AT CATCTACACT TTCATAAAAA APiiTiirir 


LATtTACTAG 


9060 


GCTACCTGTG CTTGATATAA TTGAATTACT AATTGTATCT* TPfTrTr r A A 


Ay 1 AL I LML 


9 120 


TATAGCATTA ACACCAAATT CTTTACACGT CCGTAACACT TTATTTAArA 


w.urAruu\A 


ni An 

9180 


CCAAATTAAA GGAAAAGAGT CGATTTTTTT TACAGAAATC &atat&ttta 


1 AO v_ 1 IX. 1 11 


7240 


TTTAGTTAAA TTTTTTGCAT ACGCATTTGG AATATACGAC AATTPrrrri 




tk ^ n rt 
7 jOO 


AATCGCTTGA TAAGTTTCTT CTTTAACATT TACTCCACCA TTAAVAArrr 


X K & *^*TV***T» 

O i L» AAAL Iu J 


a 1 £ n 
9 J60 


TTTTGGAGAA AAACCTGATA AACGTGCAAT ATPATAAATA nTTAfv*«i w PT*»p 


TCLCAl I TAT 


ft >• ^ rt 

9420 


ATTTTTCATT TCAGTCCTCC ATTACGAACA TTCTAATATT APTATAPAAT 


Al 1 1 AAI III 


94 o0 


TTTTAACAAG AGAATTTAGT AAATTATTTA AGATCCACAA ATTPAPAAAA 


i 1 AH 1 i 1 1 AC 


9D4U 


AAATATTCTT CCCCTTCAAA AAAGTTTAAA TTGCATTTCA CACCTTTATT 


TTTAAGAATG 


7DUU 


TTTCCAACTT CACGACAAAT AAATTCATAT GAGAAAAAAC TGCCATAAAA 


TTGTAGATTA 


0 A £ A 
7DDU 


ACTTTTTCAG TAAAATGTGT AGGATTTATA AAAACATATA ATAGCCTGTC 


AATGTAACAT 


9720 


TTTAACATAG AGTTAATTTT TTCTTTAAAG ATAACATTTG TTATCAACTC 


ATCAGGAGGT 


97B0 


AAATGAAAGG CAAACACCAT TTCACAAATA TCATAAAAAG AAATAAATTT 


GTATACTTGT 


9840 


ATCAAACAAT TATTATCAAA ATATTCTATT TTACCTAAAT CAAAATTGAT 


TTTATAATCT 


9900 


TTCATAAAAA CCTCTGAGCA AAAATCTACT CAAAAATTAG ATGATTAAAA 


CATCTAAAAA 


9960 


GCAAAAGGAC AAAAACATCT GTCCCTTTGT TTACTAAATT TCAGCTAATT TCTTCGACAT 


10020 


AAATAACACC TACAATATTA GCAATTTCTT CCATCAGTCG AAGATGTTCA 


AATCTACCTG 


10080 


ATAATTCCAG AGTAATAAAT GACGCTATTT TTTTGTCCGG AACATCAAAG 


TATTCAATTC 


10140 
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TGTCAGAATT AACATCTCCA AACGCTGTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 10200 

CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACCACTATA TACAATTCAA 10260 

TCATTTTAGA ACGATTTTGC AGATATTTTT TTAGTGGTTG GAACATGGAT ATCACACCCC 10320 

AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 10357 
(2) INFORMATION FOR SEQ ID NO: 192: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGGGACATTC TCAATCTTCT GTCTTTTGTT TTTCTCTTCT TTCTATGATA CAATGGAAAA 60 

AATAAATTCA AAAGGAGTTT TTTTATGACT TATCCAAATC TCTTGGACCG CTTCTTAACC 120 

TATGTTAAGG TCAACACGCG CTCTGATGAA CACTCTACTA CTACTCCAAG TACACAGAGT 1B0 

4 

CAGGTTGACT TCGCAACAAA TGTCCTAATT CCTGAAATGA AACGTGTTGG ACTGCAAAAT 240 

GTTTACTATC TACCGAATGG TTTTGCTATT GGAACCTTGC CAGCCAACGA TCCGTCTTTA 300 

ACACGTAAGA TTGGTTTTAT ATCGCACATG GATACTGCTG ATTTTAATGC TGAAGGAGTC 360 

AATCCACAGG TAATTGAAAA CTACGATGGT GGTGTGATTG AACTAGGGAA TTCTGGTTTC 420 

AAACTCGATC CAGCTGACTT CAAGAGTCTT GAAAAATATC CAGGACAAAC GCTCATCACA 480 

ACAGATGGAA CAACCTTGCT AGGTGCTGAT GACAAGTCAG GAATTGCTGA AATTATGACA 540 

GCCATTGAAT ATCTAACTGC TCATCCTGAA ATTAAGCACT GTGAGATTCG TGTTG G TTTT 600 

GGTCCAGATG AAGAAATCGG TGTTGGTGCC AATAAATTTG ATGCAGAAGA TTTTGATGTG 660 

GATTTTGCCT ACACTGTTGA TGGTGGTCCA CTAGGTGAAC TTCAGTACGA GACTTTCTCA 720 

GCCGCTGGTG CTGAATTGCA TTTCCAAGGT CGTAATGTCC ACCCTGGTAC TGCCAAAGGG 780 

CAGATGGTCA ATGCCCTTCA GCTAGCAATT GATTTTCATA ATCAACTTCC AGAAAATGAC 840 

CGACCTGAGT TAACTGAAGG TTACCAAGGT TTTTACCATC TAATGGATGT GACAGGTAGT 900 

GTTGAGGAGG CGCGTGCAAG CTACATCATT CGTGATTTTG AAAAAGATGC CTTTGAAGCG 960 

CGTAAAGCAT CCATGCAATC TATCGCTGAT AAGATGAATG AAGAACTTGG GAGCGACCGT 1020 

GTCACTCTCA ACTTGACAGA CCAGTACTAC AATATGAAAG AAGTCATTGA AAAAGATATG 1080 

ACTCCAATTA CCATTGCTAA AGCCGTTATG GAAGATCTAG GTATCACGCC TATTATCGAA 1140 
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CCAATCCGGG GTGGAACAGA CGGCTCTAAG ATTTCCTTTA TGGGAATCCC AACTCCGAAT 
ATCTTTGCAG GTGGCGAAAA TATGCACGGA CGTTTTGAAT ACGTTAGCCT TCAGACTATG 
GAACGTGCAG TTGATACCAT CATTGGCATC GTAGCTTATA AAGGCTAAAA AGACGAGGTA 
GCTCAGCTAC TTCGCCTTTC TTTTTATTCT ACTGCTTT T T CTTGATTTCC AGTAGTTGTA 
GAAGATTCTG TTGTTTCATT TTCTGAAGTT GATTCAGCAG "GTTTAGAATC TCTTGTATTG 
CTTGGTTTGT TTTCGTCGCT AGCAGTTTCA ATGTTAGATT CTGCAGTTGC GTTTGGTTGC 
TTCTCAGCAC TGGTGTTATC ACCATTTGCT TCAGCATTTC TTGCTGGACT TGTTTCTTCA 
CTTGCGCTAG CTTTTGACTG GATTTGATGA TTCAAAACTA GAATAGCTTT TGTCGATTCA 
AGTAAAGCTG TTTTGTCTTT ACTCTTAGCA GAAAGTTGAT CTAATAATGC ATCCACCTTA 
TCAAAGTCCG CATCAGATCC ATTATTACTT TCTAAATAAG AGTGAAGCGA CATGAGAATA 
TCGTAGAGTT TTTGATAGAG TACAAGTGTC TGAGGATCTT GCTCAGCATT TTCCTTTTCT 
TGTTGAAGGG CGCTAGCGAT ACGAGTCAAG ACATCTTTTA CCTGACTGTT TACTTCATCC 
AAGTCTGCAT CAGCCTTGTT TGTGGCAGCT TTTAGATTTT CTACTTCTTC TGCCAAGGAT 
TGTCTGATTC CTTCTTCATG GATTTGTTCC AACAGTTGAT TTGCCTTGCT CAAAAGACTT 
TCTACTTCTT CCTTGCTATC TGTCGCAGAT TATTGGTTGC TATCTACCAT GTACTCCTAA 
AACAGGAGAG TTATAATCCA AGATTACAAG GCCTTACAGA AATAAGAAAT CCAGATAAGA 
CAATGTTCGT CCAAGACGCT ATTCGCTTCG CACAGCAGCA CGGATTCAAT ATGCTTTAAT 
TTTAAAGTTT AGGTGTCAAG ACCTCTTTTT AGTGTGCCCA AAATTTAGAG AAGTAATCAA 
TCAACTAACT TTTATTTTTT TCAAACTTTC AGTAAACTGA CCTAAAGCTA ACTCAATCTG 
TCTTTGTAGA TGCTTCTGCT ATCAGCTAGA AGTTGATCTA CTTTTGCCAA GACTGCCTTC 
TCATCAAAAG TTCCAGGTTG ATAGTTGGAT TGCAGGGATG GAATCTTGTT TTTCAAAGCC 
GCTTCATATC CCTTAGTTTG AACCTTGATG TAGTGATTGT GGTCGCCATG AGGAATCACA 
AAACCTTCTG AATCTTCACT TATAATTCGA TTGGCATCAA AACCATGACC ATCTTCTTCC 
TCATGATGGA CATGTAGTGA CGGATTACTT AATACAGAAC TAGAAGAACT TCCTACCTCT 
TCCGTGTTAG AGTGTGATGG GGGATTGTTA AGAGATGACT TAGGAATATA GTGATAGTGA 
TCCCCATGTC TTACTATATA AGCATCACCT GTATCTCTGA CAATATCATT AGGGTTAAAG 
ACATATGTGG CTGCTAATTC ACCTGCCGAC AAGTCACTCT CAGGAATGAA ATGATAGTGA 
CCACCATGTG GTACTATAGT AGATTGAAAT AGAATATGAG CAAATTGATA AGGGGATTTT 
AAAGTAATTT CTAACAATGA TTTAGAAACT ATGATGTGCT ATTCTAAATT CAACTCACTA 
TATATAACCA TCATCGGTAG TATAACGTCC CTGTAATTTT GCTACAGATA CTTCTGCACT 
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AGCTCCTTTA 


TCGTCTTTAC 


CATGTTCTTG 


TTTTTGGCGA 


TTGATTTCAT 


CTTTTGTTCG 


3000 


TACATTTTCT 


GCATGAGCTT 


GATCTTTAAG 


GTAAACATAA 


TACTTTCCAT 


CTACCTTAAT 


3060 


AATATATCCT 


CCCTTAACCT 


AACTGACGAT 


ATCTTGATCT 


TTCGGCTGAT 


AGTTGGGGGC 


3120 


TTTCATTAAT 


AGCTCTTCAC 


TAAAGAGCGC 


ATCAAAAGGA 


ACTTTACCAT 


TATAGTAGTG 


3180 


ATAATGATCG 


CCATGAGAAG 


TTACATAACC 


TTGATCTGTA 


ATCTTAATAA 


CAATTTGTTT 


3240 


TGCTTGAATT 


CCTTCTTTTT 


GACTAACCTA 


GTCTGGAGTC 


AAATTTTCAG 


TCTTCTTAGT 


3300 


GTCTTTATTA 


CTGTTTACAT 


ATGAAACACG 


ATTTTTATCT 


GTATTGGCCT 


GTTAGCTATG 


3360 


TTGGTTCAGA 


GCATAAACAC 


ACAGACTTAA 


GGAAAGGATA 


ACAACAGATC 


CAGCTGCTAT 


3420 


ATATTTCTTT 


TTAAATTTCA 


TAATTACCTC 


ATTTCTATAA 


TTATTTATAT 


GATGTCTTCA 


3480 


TTATTAAATG 


ATTAAATAAA 


TTAATTAACC 


AATTAATTAA 


CTAGTAAATA 


TTCCACCTCT 


3540 


TTTTAAGTTG 


TATGTCAAGA 


AATTTTATAT 


ATTAATAATA 


AAATGAAATT 


CTCCCAAAGT 


3600 


CAGAGTTTTA 


TTTCTAACTT 


TTGAGAGAAC 


TTCATTTTTG 


ATTCAGACTT 


TTTCTACTGC 


3660 


TATTCCTTAC 


GCTATGAGAT 


CAGATAAATT 


CnTTTTATC 


ACTTCTCCAC 


TTGGCAATCT 


3720 


TAATTCAATC 


GTTCCATCCA 


TATTGAATAT 


AACACTATCT 


AAGCCTAATC 


CGTAACTAGC 


3780 


TGTAAATTTT 


TCTAATTTTT 


CTTGTACAGG 


ATCTACTGCT 


GGAGCTTCCT 


CTAATGCTGG 


JOlv 


ATCTAACATA 


GGGTCACTCC 


CCACATTCCC 


TTCTGGATTC 


AACATTCCAT 


TATCCGTTGA 


3900 


GTTTTCTGGT 


TTTACAGGTT 


TTTCGTTTGG 


TGCCTCTGGT 


AAAGAATCTG 


CTGGTTTATT 


3960 


TTCTGTTGGT 


TGGTTCTCAA 


CTGTTCCAGT 


AGATACTTTT 


CCATTTTCAG 


ATGGTTTATT 


4020 


TTCACCATTT 


CCTTGAGGTG 


CTTCTCCTGT 


AAAATCTGCC 


ATATTCTTTT 


TAATGACTTC 


4080 


TCCCGATGGT 


AAATATAATT 


CAATTGTTCC 


GTCCATATTA 


AACAAGACAT 


TTTCTAGCTT 


4140 


CATCCCATAA 


CTTTCAGCAA 


ATTTTGCTAC 


TTTTTCTTGT 


ACAGGATCCA 


CTGTAGGAAC 


4200 


TTCTTCTAAC 


GTTGAATTAC 


TAGTACTATT 


CCCAGTTTCA 


GAAAGTTTTT 


CTTTTTCTAC 


4260 


CTTCTCACTA 


GTCTTTGGTT 


CTTCTACCTT 


TTCATCAAGT 


TTTAAGTTTT 


CTTGTGCTTT 


4320 


ATTCCTTTTA 


AATTGTGGTA 


GAATACTTGG 


TTTATCAGTT 


TGATTTTCTT 


TTTCCAAGAT . 

* 


4380 


AGGTACTTCC 


ACAATATAAG 


TCGATTGATT 


GTCCAAATAA 


GCATTPGCCA 


TGAAGGTTAC 


4440 


AGGAATTTTA 


TTTCCGGCCG 


TTCTGGTTGT 


TCCTTGGTTT 


AATTTCGGAA 


TCGGTAATTT 


4500 


GATTTCACCA 


ACTTTATAGT 


TATTTTCTAA 


ATAAGCATTT 


CCATGAAATT 


CATCAAACAC 


4560 


TCTGACTAAA 


GCATCAGTTC 


CTTTAGGCAC 


TGCAAATTGA 


GGGTTCACTC 


TTAAATAAGT 


4620 


ATCCCCTGCA 


TGGAAAGGAT 


AGAAAATCGT 


TTGACTGGCC 


ATTTTGTAAG 


CTAAAGAGGT 


4680 
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TGGAACTGTA AATGTACCAT CATAACTTAC TTCTGGATAA TCTTTTGAAG CGATAGTATA 
CTTAAATGTT TGTCCTGGTA AATAAGGTTG ATCTAATTCA AAGTTTGCAA TATTCCCTAC 
TCCTTCTCCA AATACTTTAC CAGATACTTT CTCCAATACT TTTCCATCTG GTGTTATTAA 
TTTTACTAGC ATATTGATAC CTAATTTTTT CTCCAATTCA GGCCGAAAAC TAAAAGAAAC 
GCGTTTTTGA CCATTGGCTA GAGTAAAGTT TTGATTATTA AACGTACTAT TTTTTAACAA 
ATTAACAACA TTCGTTAATT CTTCTCCAGT ATAAACTTTA TTCCCTTCTT TTTTAGCAAC 
TCCTTCTTCG GGTTTAAACA GTTCATAGTT ACTGTGAGAA TGACCAATTC CAACCGGTTT 
ATGTTCATCA ATCGGATCTG CATGATGGTG ATCTCCATGC GGATAAATAA TCGCATTTTT 
TTCTTTATTC ACGACAATAC TTTCACGTTT G AC AC CAT AT TGTTTCATAA TGCCAGCAAT 
TTTTTCTTCG ATTTTTTTAT CTAAATCTTT CATTTCTTTG GCATTACTTG GATAATCCTG 
TTCATGAGAT GACAAAGAAT CTAATCCATT ATGACTAGTT TTAACTTCCT CTAAATGTTT 
TTGCGCAsCT TAATTTGCTC TTCTGTCAAG TCCTTCTTGA AGAAATAATG ATTGTGGTCT 
CCGTGACTCA TGACAAAACC TGATTCATCT TCAGCGATAA TACGATTAGC ATCAAATCCG 
TATCCATCTT CTPCATGTTT CTCATGTGAA GTTCCTGGAT TGATTGGAAG AGATGGAGAA 
GGTGTTGCTA GACTATTGTT TCGAAGAGTC GGTTCCCCAA TTTGATTTGA TTTTGGAATG 
TAATGGAAAT GATCACCATG TCTTACAATA TAAGCTGTAG CCGTTTCTTC AACGATATCT 
TTTGGATTAA AAATATAACC ATCAGATGCT GAAGAGAGCT CCTTACTTGT CGTTAAAGAA 
GAAGGATTGC TTGAAAGACT GCCTAGACTA GACACTACTT CATTAGGTTT TGCATTTGTA 
GAAACTGTAG AACCAGTTCC ACTGATAGGC ACCATTCTGG CAATCTTTTC TTCTAAGGCA 
GAAAGCTTGC TGTAAGGAAT AAAGTGGTAA TGGTCGCCAT GCGGAATCGC AACTCCATTT 
GGTGTACGAC TGATAATCTT AGCAGGGTCA AAGACCAGGC CATCTGATTC ACTGTAACGT 
TGGGCGCTAG GTGAATCATA GAGTTCCTTC AAAAGACTCT GGAGATTTTC AGATTTATTT 
GCTGGCTTGC TAGTTGATCC TTTTGCTACA GATTGCGTGT TATTGTCACT AGCTGTTGAA 
GAATAGCTTA ACTGACTCGG TTGCATATTT TTTCCAGCCA GATGTGCTTT AGCTGCTGCT 
AATTCACTAG CAGATAAATC GCTTTTGGGA ATGTAGTGAT AGTGACCTCC ATGAGGAACG 
ATATAAGCAT TACCCGTATC TTCGATAATA TCAGCTGGAT TAAAGACATA ACCATCATTT 
GTCGTATATC GTCCCTGAGA CCTTGCTACA GCAACATTAG AGTTAACCTT CTCATTATCT 
TTGACATGTT CTTGTTTTTG ACGATTGATT TCATCTTTAG TTCGAACATT ATCAGCATGA 
GCTGCATCTT TCAGGTAGAC ATAATATTTT CCATCGACCT TGATGATATA ACCACCCTTG 
ACTTCATTGA CAATATCAGC GTCTTTAAGT TGATAGTTTG GATCCTTCAT CAAGAGTTCT 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATGATAGTG GTCACCGTGT 6540 

CACGTTACAT ACCCCTGATC TGTAATTTTG ATTACAATTT CCTCAGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGCTCTGG TGTCAAGTTT TCACTTTTCT GACTTGACTG GCTGCCATCC 6660 

ACATAAGAGA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCACCTG CTATATATTT TTTACTAAAT 6780 

TTCATAAATC CCTCATTTCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TTACTTTATT 6840 
AAATAGTTTT CTAAACCCGG GGGTACC 
(2> INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 
(B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



6867 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

CGTTCTAAAA ATGCAGTACG TTTGATTGAG AAATCAGTTA AAGGTATGCT TCCACACAAT 60 

ACACTTGGAC GCGCTCAAGG TATGAAGTTG AAAGTATTTG TTGGAGCTGA GCACACTCAC 120 

GCTGCACAAC AACCAGAAGT TCTTGACATT TCAGGACTTA TCTAAGGAAA GGAACAATAA 180 

AGTATGTCAC AAGCACAATA TGCAGCTACT GGACGTCGTA AAAACGCTGT TGCACGCCTT 240 

CGCCTTGTTC CAGGAACTGG TAAAATCACT GTTAACAAAA AAGATGTTGA AGAGTACATC 300 

CCACACGCTG ACCTTCGTCT TGTCATCAAC CAACCATTCG CAGTTACTTC AACTGTACCT 360 

TCATACGACG TTTTCGTTAA CGTTATAGGT GGTGGATACG CTGGTCAATC AGGAGCTATC 420 

CGTCACGGTA TCGCTCGTGC CCTTCTTCAA GTAGACCCAG ACTTCCGCGA TTCATTGAAA 480 

CGCGCAGGAC TTCTTACACG TGACTCACCT AAACTTGAAC GTAAGAAACC AGGTCTTAAG 540 

AAAGCTCGTA AAGCATCACA ATTTAGTAAA CGTTAATTCG AAAGAATTAC TATACTTATA 600 

CAGAGCACCT TTCGGGGTGT TCTTTTTTTA TACTTTCTTA CTAAATTGGT GCAATTGACA 660 

CAGTTGTTGC GACTTTAGTC GCTTACAAAT GTGGCTGCAA CCTGACATGG TCAGTTGCCT 720 

CAAAACGTTA ATCAATACGA TTATATCAAC GTTTCAAAGC ACTCAAGGGT TTACCCTATG 780 

GGTGCTTTTT TCTATACTTT CTAAAAAAGT TTACCCTAAA ATTTGCCCTA AAATTACCCT 840 

ACTTATTTTT AAGATGTTGG TAGGCAACTT GTCCAGCAGA TAATGGAACT ATGTTTGAAG 900 

TATTAACATA AGTCTTAGTT GTAACCGTAT CGCTATGAGT TAATGCTTCA GAAATGGCTT 960 
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CTAAGCTCAT TCCTGCTTTT TTAGCAAGTG TCGCTCCTG 999 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTCA TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAAGTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 120 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCTACATCG TTTCCATATT 180 

ATCCAACATC TCAGCCGTGC CATGAGTCGT TTTCGTGTTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CGTTACTGGA AACTCATCCA ACAGGATAGT 300 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 360 

GAAATTCTTG ACAAGATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 480 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTGTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCGTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAACCTTAGC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 780 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGCATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT TCATTTTCAC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCGAGTGTA TTTTTGAAAT 1080 

GACCTAGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAGGACACAG ATATCCGTCA 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCGTTTAT CCATCCAAAA GACACGACTG TCTGAACCAT 1320 

GCTCTTGATA AAAGATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 
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TATGAGGAGG AAATAACTTA CTTTCCGGAT GGAAACAATC GTTTTCTTCA TGAGCATCAA 1440 

TAGTAAAGAA GATATAATCT CCTCCTTCAA AAGCTAATCG AGTTACCTTG CTGATGGCAT 1500 

CCGAAATCGC CTGAGCTGGA GCACCTGCTG TTAGTTTCCC ACTATCAGCA ACAAAATCTT 1560 

CTGTATAATC AATCGAAATT AAAGCCTTTG TCATTAGTAA TCTCTTTTCT TCACTTCTTC 1620 

AAAAATATCT GAAATCAAGA CCTTAAGATA GGTTCCCTTC ATTCCAAGTG AGCGACTTTC 1680 

AATAATCCCC GCAGACTCAA GTTTACGAAG AGCATTGACA ATCACAGAGC GAGTGATTCC 1740 

GATACGATCT GCAATCACTG ACGCAGTCAA CTTCCCTTCA TTTCCATTTA ATTCCCCTAA 1800 

AATTGCTGAA ACAGCACGGA GTTCGGAGTA AGAAAGGGTA TTGACCGCCA TGGTGACAGC 1860 

AGTACGACGA CGAATATTTT TCTCATCTTC TTCACGTTGG AAGTTAAGAA GCTGAATCCC 1920 

AACAACGGTA CTGGCAATCT CAACAAGAAC CAAGTCCTCA TCTTCGAATT TTTTATCATT 1980 

ACGCCAAATA ATCAAAGAAC CAAGGCGAAT CCCCGATACA TGAATCGGTG CAATAGTCGT 204 0 

CAAGCCATCT GGAAAATCAT CTCTACTCTC AATAGGGAAA ATACTCATAT CATGCTCAAC 2100 

AGGCAAGTTT GCTTCTGTTT CGTAAATCAT ATTAGCCCCT TGAACCTAGT CATCTGGGAA 2160 

AATCTTAGTT TGGAAGAATT GCTtACGCGA TCTGTATTTG TTTTATAACG CATAAAATAG 2220 

CCAAGCAGAC GTCCCTTACT ATTGATAATG CAGGCATTGC AATGAATAAT ATCCGCTAAC 2280 

TGACGCGTAA TAGCGTTGTA AGGGAGCTCA TCTCG 2315 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 
ID) TOPOLOGY: linear 

txi> SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATGTGTT ACCTTCTTCA 60 

TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 120 

AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 180 

ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 240 

TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC CTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 300 

AATTAATTTA TCCGCCACAG TGTAGCTGTA GATTCCAGCT TCCATGTCGA CCACACTCGG 360 

TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 420 
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AGGAACCAAG ACTTCGTCCG AATCATTCAT GGTCAATCGA ATTAAATCGG ATGTCACCTT 480 

GCTTGGGCCT AATTCCACCT TTTGGATAGC CGCCTTGAGT TCTGGGCTAA TTTGAGCAAG 540 

TTCTGAGACA AAAACTTTGA TTTGTTCACT ATCATTAAAG AGAACTGATA AATAAGTTTC 600 

TGGTAAACTG TTCAGACTCA CAGAACTAGT CTCAAGCTGA CCACTGGAAA GAATAGGATA 660 

ATGATTTTCA CCAGAAATAT AGTAGGCCAC AATATCATAT TCCTTGACCT TAATAGTGAA 720 

CTTAGTTGGA AATTGATAGA CAAGTTGAGC TGATTCAACC CAATAGTTAG ACTTAATCTG 780 

CTTTTCATAT TTTGCCTTGT CTAGCAGAAG GTTAATCGTA TAATCCGAAT CCTGAATGCC 840 

TGAAGCCTGT CGAATATCAT CAGCTGTAGT TTGCACCGTT CCCTCAACAC GAATATCTTT 900 

CATGGTCGCA TAAGGACTGA GCAAGTAGGC AGAGACAAAC AATAGAAGCA GACTTGGAAA 960 

TAAAATCGTG AAGGCTCGCA AGATATGGAT ACCAGGAATC TTTGCTTTGG CTGGTTTTTC 1020 

CTTTGTAGCC TTTTTAGCAA GCTTTTTATC CTGTTCCTCC TTCTCTTTAG ACTCTGGTTC 1080 

TTCTTTCTCT TCTTTCTCTT TGTCAGCCTC TGAGGATGCT ACTTTTTCTT CAGACTCTTC 1140 

CTTAGCTGAT TCTGAATCTT CCTGGTCTGT TTCACTCTCC TGGTCCTGTT TATCCTCTGA 1200 

CTTCTCAGAT TCTTCTCCCA TTCGAGCTTG TCTTTCCTTT TCCTTCTCCT CAGCTAGAGC 1260 

CGCCTCTTCT TCAGCCTTCT TTTTTAGATA TTCTTGGTTT CGTTTCTGCC ATTCTGATAA 1320 

CTCTTTCAAT TCTTCGAGGG TTTCTTTGTC CTCATTTTTC TTATCTTTTG ACATTTACTT 1380 

TCCTTATGAT AAATCTTTTT TCAACAATTG ATAAAAATCT GCTAGAGATT TCAATTCCTT 1440 

AGAAGCTTTC ATCTTAGCTT GGTAATCTTC CTTGTGACTT AGTAAGTGAG AAAGCTTCTC 1500 

TTCCAAACTA TCCAAGGTCA AATCGCTTTC TTGAAGGTCT TCTGCATAGC CTTTCTTAAC 1560 

AAAGTAAGCT GCATTTTCAA TCTGGTCACC ACGACTAGCT TCACGACCAA GCGGCACAAT 1620 

GACATGCAAT TTTGCTATCG CCAAGAGCTC AAAAATCGTA TTGGCACCAC CTCGTGTCAC 1680 

AACAATATCA GCCAATTCCA TCAAGGGTTG ATAGAGATCG GTCACATAGT CAACACGAAA 1740 

AAGATTTTGC CTCAACTCAT TCAGACTAGA ATCTCCAGTT AGATTGATAA TATTGTAGCG 1800 

CTCTGTTAGT TCTTTCTTAT GGTCTGTCAC CAATTGGTTA AAGACACGAG CGCCTGCAGA 1860 

ACCGCCAACA AACAATACAG TTGGCAATTT GGGATTAAAG TGGGTTTGAA TATCCACCAA 1920 

TTCATCTGGT TCTGGAGTGT TTTTGTCCGA AACCTTGGTC ACCGCTCCCA CATGCTCAAC 1980 

CTTAGCCAAA CTCGAAGCTT CTTCAAAGGT TGAATACATC TTAGTCGCAA ATTTATAGGC 2040 

GATTTTATTG GCCAAGCCCA TAGACAGGTC AGATTCGTGA ATAAAGACAG GCACTCCTGA 2100 

CACACGCGCA GCGATAACAG GCGCTACTGA GACAAAGCCC CCCTTTGAAA AAAGGGTCTG 2160 

TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT CCCCAACCAA CTTTGAAGAC 2220 
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GTCCAGCATA 


. TTTTGCCAAG 


AGAAATAGCG 


: ACGCAATTTT 


' CCACTCGCAA 


TAGAATGGAA 


2280 


GGTGACATCC 


AAACCTGACT 


TAAGCATTTC 


TTGGTGTTCG 


; ATACCACACT 


TGTCCCCGAT 


2340 


ATACTGGACT 


TCCCAACCAT 


CTTCGATGAA 


CTTGGGCATT 


AACAAAAGAT 


TGAGGGTAAC 


2400 


GTGTCCAACC 


GTCCCCCCAC 


CTGTAAAGAC 


AATTTTTTTC 


ATATTATTCT 


TTTAACTCCG 


2460 


CTACTGTGTC 


GATAAAGAGG 


TCGCCACGTA 


CTTCAAAGTT 


AGCATACATA 


TCCCAGCTAG 


2520 


CATTGGCAGG 


ACTAAGAAGA 


ACCACATCTC 


CTTGAGTCGC 


AAGCTCATAG 


GCCTTGCGGG 


2580 


TCGCATCTGC 


AATATCTGTC 


GCCTCCACAT 


AAGCGACACC 


AGCCTTGTCT 


GCTGCCCGTT 


2640 


TGACACGTTC 


TGCAGATTGA 


CCCAGGATGA 


CCATCTTCTT 


GAGTCCAGTA 


ATGTCTGCCA 


2700 


CCAATTCCTC 


AAACTCATTG 


CCACGGTCCA 


AACCACCTGC 


AATCAAGACG 


ACCTTGCTGT 


2760 


TGTCAAATCC 


TGACAAGGCT 


TTTTGAGTAG 


CCAAGATATT 


AGTTGATTTA 


CTGTCGTTAT 


2820 


AGAATTTAAC 


ACCCTTGATG 


TCATCCACAA 


ACTGGAGACG 


GTGTTTGACA 


CCACCGAAGG 


2880 


CTGAAAGAGT 


TTCCTTGATG 


GTTTGATTGT 


CCACATCACG 


AAGCTTGGCT 


ACAGCAATAG 


2940 


TCGCAAGGGC 


ATTTTCCACA 


TTGTGGCTAC 


CTGGAACACC 


GATTTCATTC 


GCTGCCATGA 


3000 


CTACTTCACC 


ACGGAAGTAG 


AGTTGACCAT 


CTTCCAGATA 


AGCTCCATCA 


ACCTTTTCAA 


3060 


GTGTTGAAAA 


TGGTACAACA 


GTGGCTTCTG 


TCTTGGAAGT 


CAAGTCTTTT 


GCCAAGTCTT 


3120 


GATTAAAGTT 


CAAGACAAGG 


AAATCAGCTG 


CTGTCATCTT 


GTTCTGGATA 


TTCCACTTGG 


3160 


CTGCTACATA 


TTCCGAAAAT 


GACCCATGGT 


AGTCGATATG 


AGTTGGCATG 


AGGTTGGTAA 


3240 


TAACCGCAAT 


CTCTGGATGG 


AATTCTTGAA 


CACCCATGAG 


TTGGAAAGAA 


GAAAGTTCCA 


3300 


TAACAAGCGT 


GTCCTTATCT 


GATGCTATTT 


GAGCAACCTG 


ACTAGCTGGA 


TAGCCGATAT 


3360 


TCCCTGATAA 


AAGACCATGT 


TGGCCAGCAG 


CAGTCAAAAC 


TTCCCCAATC 


ATAGTCGTTG 


3420 


TGGTTGTCTT 


ACCGTTCGAT 


CCTGTGATAC 


CAATAATCGG 


TGCTTCTGAA 


ATCAAATAAG 


3480 


CCAATTCCAC 


CTCAGTCAAG 


ACTGGAATTC 


CCTTGGCCAA 


AGCCTTTTCA 


ATCATGGGAT 


3540 


TGTTGTAGGG 


GATACCTGGA 


TTTTTCACCA 


TAAGGGCAAA 


CTCTTCATCC 


AAGAGTTCCA 


3600 


AAGGATGGCC 


ACCTGTAATG 


ACCTTGATCC 


CTTCTTCCAG 


CAAACTTTGG 


GCAGCTGGAT 


3660 


TGTCCTCGAA 


AGGTTTCCCA 


TCATTTACTG 


TCACAATGGC 


ACCTAGCTTG 


TCCAACAAAC 


3720 


GAGCTGCAGA 


TTCACCAGAC 


TTGGCCAAAC 


CTAAAACAAG 


gactttctta 


TTTTTAAATT 


3780 


GATCTATTAC 


TTTCATGTCT 


CGAACTCCAT 


TTCTACTCCT 


ACTATTTTAC 


CATTTTTATG 


3840 


GAAATAAAAA 


AGOCACAAAG 


TGTGTTTGTG 


ACTCTTTCTT 


CTAACTGAAT 


CTTACCATAT 


3900 


CATCTATGTG 


ATAAATCGGT 


AACTCGAATG 


ACCTGATCCA 


CTTGCTCCCA AATCAGAGGA 


3960 
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TTATGGGTCG 


i CAATAATAAT 


■ GGTCCGATTC 


GGATTTTITA 


. AAGATTCTAG 


GATGGAAAGT 


4020 


AATTCCTCAG 


> AGTTTTTGGG 


GTCTAAGGAA 


GCGGTTCGTT 


CATCTGCGAG 


GATCAAAGG? 


4080 


GGATCCTTTA 


. AAATTATCTT 


CGCTAGTGCA 


ACACGTTGTG 


CTTCTCCTCC 


TGATAACTCA 


4140 


AATATAGGTT 


GCTTCAAATC 


CAAATAAGAG 


AGGTTTACAC 


GGTTTAGAGC 


TTGTTTCATC 


4200 


AAAGAGATTT 


TCTCTTTTTC 


CTTCAACTTT 


TTACCAACTA 


AACCCAGATT 


GAGATTCTCT 


4260 


TTGACGGTTT 


GGCTTTCAAT 


TAAGCCAAAA 


TCTTGAAATA 


AGTATCCTAA 


GTAATCTCTA 


4320 


AAGAAAACAG 


AAGGCTTGAT 


GTCCTTAAGA 


GAAGTGCCAT 


CATAGATGAT 


TTGCCCTTTG 


4380 


TCATATGGCT 


CCAATCGTCC 


AATCATATTC 


AAGAGTGTTG 


TCTTACCACA 


GCCACTTGTA 


4440 


CCGATTAAGG 


CATAAATTTT 


CCCACCTTCA 


AAATGAAGAT 


TCATATCTGA 


AAATAGCTGA 


4500 


CGGCTTCCAA 


ATTTTTTAGA 


TATATTCTTT 


AGTTCAATCA 


TCCTATTTTC 


CTTTCATAAT 


4560 


TGTCATAGAA 


ACACGAGATT 


CTTTCTGCGC 


TTGACGCTAA 


AGCGTCAAAA 


CTGCACTAGC 


4620 


TAGAAAGACC 


AATAAAGTGA 


GCAAGCCAAT 


CACCAAGTCT 


CGACTGCTTA 


AAATAAAGAG 


4680 


ACTAGCACCA 


AATACAAAAC 


TAGCAAATTG 


GCTAACCATA 


TACTGAGCAT 


GTGTTTCAAA 


4740 


AAATCGTAAA 


CCTGAAATTC 


GTTTAATCAA 


GATATCTCGG 


CGGAATTGCT 


CGAAATATAG 


4800 


AAGATTGACA 


GAATAAAAGA 


GTAACAAGGA 


ACTGGCTATT 


CCAACAATAG 


CTCCTAAGAT 


4860 


TAAAGTTGCT 


GTTTCAGTTT 


GAACTTCATT 


ATAACGAGTT 


AGATAAACAC 


TTCTTCCTTC 


4920 


TTTAAGATAG 


GATACTTGCT 


CATAAATTCC 


AGCTTTCTTC 


AAGAGTTCTA 


GCCCACTCTC 


4980 


ATATCCTTTG 


ATAAAGAGTT 


GTTTTCCAGC 


ATTGATAGAC 


CAACTAGATA 


AGGATATAAA 


5040 


ACTATCACCT 


GTAGAAGTCG 


GCGTGAATAC 


CACTAAAATC 


GGATCAGTCA 


AATACTGAGT 


5100 


AGATACGGGA 


TTCTCACCGT 


TATTATAAAC 


AAACCGCTTT 


TCTCCCATTG 


AAAGATAACT 


5160 


AACGTGCGCT 


TTCATCTCAT 


AATCCAAAGG 


AGCACTTCCC 


TCCTCACCAG 


ATTTTCCATA 


5220 


ATAACTCAAT 


CTTTCTTCAA 


AAACTTTCTT AAGTTCTGCT 


TCTCGAGAGC 


GCAAATGTTC 


5280 


TGGGAGCAAG 


AGGATAAACT 


CACCTTTTTG 


GACATGGGCT 


t\J\{. 1 L \* l 0 1 1 




5340 


ATCTACCACG 


ACCTTTTCCT 


TGTCCAAATA 


ACTGGGACTA 


ACATAGAGCG 


TATTAGCATC 


5400 


TGAACTATAG 


GTATCCAGTG 


TCTCTCCCTG 


TTCATTTTTT 


CCTTGTGGAT 


TGGCAAAATG 


5460 


GAGCAGATTA 


TCCTTTACAT 


AAAGAGCTTG 


TTCTTCTTCG 


ATTGCTTCCT 


TGGCAAAGGC 


5520 


ATACCACTTG 


CTCTGATTTT 


CTGTATCTTT 


TCCTCTATCA 


CCTAAGCCAA 


AGGAAATCTG 


5580 


GTAATAGTCT 


GCTCTGTCCT 


GCCATGCTTG 


TTTTGAAATT 


TCAAGTTCTT 


TCAATCGTTG 


5640 


GTAAGACGTC 


AAACCTGTCT 


TAACAGCGTA 


GCCTACTGTA 


AAAACAGCTA 


CTAACTGACA 


5700 


CAATAGGGTT . 


AAAGCCATCA 


AGCGTTTAAG 


GGGTAATCTT 


CCCTTAATAA 


CGGGAACTAA 


5760 



A 
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TGCTTTGTAA CTCAAACTCA TTACGTAAAG GAGCATTAGT AAAATTGAAA TCGCCAATAA 5820 

AAACAACAGA TAGAAACTAA TCCCAAAACC ATAGGTCGCT AACAAGATAG GATAAAACAA 5880 

ACCTTGACTA AAAAGAACGA CTCCCCCACC TAGGAAGGAA AGGAGGGCTG ATAGAAGGAG 5940 

CCATTTGATA. TCAGTAGATA AAGAATGCCC CATGATGGAT AAGAGAGTCT GACCAGAAAA 6000 

GAGTTTTATA CCTGCTGCTC TCATTTCCTT AATCCGAGTG ATAATCACTA AAGCAAAGAA 6060 

AGATAAGCCA AATATTGCTA AACTAATTAA AATAAGGGGA TTTAGTAATA TTCGAAAAGC 6120 

AAGAAAATAG GGCGGTATCT TTCGGTCAGC ACTTGCTTTA TAACCCAAAT CTCCTAATTT 6180 

ATCGGCAAGC TTTTCTTTCG TCAAGGAGCC TGACAAAAGG AGATAACTAT TTAGCGGAnT 6240 

AtACGTTCAC GACTTTCTTG GCTAGCTTCT TGGAATTCTT TTGGTAAAGT TCCCTGACCA 6300 

TAAGTTGCAT AAGTAAAGTG AGTCGTCCCA TCCTTACTCG GCTCTACAAT TCTTCTAGCT 6360 

ATTAAACTCT GTTCTGAGTT TGCAAAATTC TCCAATTCCT GTTCAAATAC CTCACGCGTC 6420 

GGTTCCTGAG TATCTTTTTT GACACGAAGT AAAGAAACGG AATCATAGCT TGCATATAAA 6480 

TATTGTGGCG CACGTAAGAC AATAATCCAA GCAAGGAAGA AGCTGAGAAA AAAAGTTGAT 6540 

AATAATATGA ATAGTTTCTT CATAGTAGAC TCCTTGTAAA CAAAATTCCC CCTGTAATTT 6600 

CTTACAAGGG GAACGATTTA AATCAATGAA CGATTAGTCA TAATCACAGT AAAATCCTAC 6660 

TTGTTCTCCC CATTTAGTCC AAATCCATGC AGG 6693 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SE0UENCE DESCRIPTION: SEQ ID NO: 196: 

CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 60 

GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC GACGATTTGA TATAAAATGA 120 

AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 180 

CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAAGT TGTTGGAGCC 240 

AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 300 

TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 360 

ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGTCTCAAAT 420 
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CATCCTTACA 


TTCTAGTAGA 


TGATGTTTTG 


ACAGCCTTTC 


AATCCTTAGC 


ATCCTACTAT 


480 


CTTGAAAAAA 


CGACTGTTGA 


TGTCTTTGCT 


GTTACAGGTT 


CAAATGGCAA 


GACAACGACT 


540 


AAGGATATGT 


TGGCGCATTT 


ACTGTCAACA 


AGATACAAGA CCTACAAAAC ACAAGGCAAT 


600 


TACAATAATG 


AGATTGGCCT 


TCCTTACACA 


GTTCTTCATA 


TGCCTGAAGG 


AACAGAAAAG 


660 


TTGGTTTTGG 


AGATGGGACA 


GGATCACTTG 


GGCGATATTC 


ATCTCTTGTC TGAATTCGCT 


720 


CGTCCAAAAA 


CAGCCATCGT 


GACCTTGGTT 


GGAGAAGCCC 


ATTTGGCCTT 


TTTCAAAGAC 


780 


CGTTCAGAGA 


TTGCTAAGGG 


AAAAATGCAA 


ATTGCAGACG 


GAATGGCTTC 


AGGTTCCTTG 


840 


CTTTTAGCGC 


CGGCTGACCC 


TATCGTAGAG 


GACTATTTGC 


CAACTGATAA 


AAAGGTGGTT 


900 


CGTTTTGGGC 


AAGGGGCAGA 


GCTGGAAATT 


ACTGACTTGG 


TTGAGCGCAA AGATAGTCTG 


960 


ACCTTCAAGG 


CCAATTTCTT 


AGAGCAAGCC 


CTTGATTTGC 


CAGTAACTGG 


CAAGTACAAT 


1020 


GCGACAAATG 


CTATGATTGC 


ATCCTATGTT 


GCCTTCCAAG 


AAGGAGTTTC 


AGAGGAGCAA 


1080 


ATTCGTTTGG 


CCTTCCAAGA 


TCTTGAATTG 


ACGCGTAACC 


GTACCGAGTG 


GAAGAAAGCA 


1140 


GCCAATGGAG 


CAGATATCCT 


GTCAGATGTT 


TACAATGCCA 


ATCCAACTGC 


TATGAAACTG 


1200 


ATTTTAGAGA 


CTTTCTCTGC 


CATTCCAGCC 


AATGAAGGTG 


GCAAGAAAAT 


TGCAGTGTTG 


1260 


CCGGATATGA 


AGGACCTTGG 


TGACCAGTCT 


GTTCAACTTC 


ATAATCAGAT 


GATTTTGAGC 


1320 


CTTTCTCCAG 


ATGTGCTTGA 


TACCGTGATT 


TTCTATGGAG 


AAAATATTGC 


TGAATTAGCC 


1380 


CAATTGGCCA 


GTCAAATGTT 


CCCAATCGGC 


CACGTTTACT 


ACTTCAAGAA 


AACAGAAGAC 


1440 


CAGGATCAAT 


TTGAAGACCT 


AGTCAAGCAG 


GTCAAGGAAA 


GCCTTGGAGC 


CCATGACCAA 


1500 


ATCCTGCTCA 


AAGGCTCTAA 


CTCTATGAAT 


CTAGCCAAGT 


TGGTAGAAAG 


TTTAGAAAAT 


1560 


GAAGACAAGT 


GATTTTGTCA AGTATTTGCA 


AAGAATGATT 


GCCATTACAG 


ATACTGGCTT 


1620 


AACCTTTACA 


AAAGATCCGT 


TTGACCGTGA 


GCGCTACGAA 


GACTTGCGAA 


GTCTGTTATC 


1680 


TGAAATGTTG 


AATCAAGCAT 


CAGACCTTGA 


TTCCGAAGAA 


GTGGCAGAAG 


TCTTGAAGCC 


1740 


AACTTCTGCT 


TATGCGACTC 


CGTTAATGGA 


CGTCCGTGCT 


TGGATTGTTG 


AGGATGAGAA 


1800 


GATTTGTCTG 


GTTAGGGGAC 


AAGGAGAGGA 


TAGTTGGGCT 


TTGCCGG 




1847 


(2) INFORMATION FOR SEQ ID NO: 197: 









ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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CAAGCGAAAA CATTTTTTAT TCCAAATAAA CAGAGCATTT TAGGAGAACA AGAGATTTTG 60 

AATGCCAAGT CGATCTTGGC CTTGCTAGAC GGTTTGGAGT CACATAGCTA TGATGTAGTC 120 

TATCTCCGTC AGCCTCTTAA TCGTCTCGAA TATATCGAGT GTGCGATAGT GGGGCAATCA 180 

CAATTTCTCT TTAAGGTCAG TTATGCTGAT GGTCAAAAGG CTTACCCTGT CGATCTTCCT 240 

GACCTACTAA CAAAGACAGA CTGGCAGATT ATCAAGTCAT TTTTAGATGC TTTGCTTGCT 300 

TATACAGGGA CTGATATTGA AGGGCTAGAT GGTTTTGATT TTGAAGCTTA TTTCCAAGCA 360 

AGTATTCAAG CCTATCTAGC AGACCCTGTA GCTCGTTTTA CGATTTGCCA AGGAATTTTT 420 

AATCCTATTT TCTTTAGTCG TGAGAACTTG AAAAGCTTTT TAGAGGCAGA TGGCTTGGCT 480 

CAGTTTGAAG CGCGTGTGCG TGCGGTTCAA GAGACAGATG CCTACTTTGC GAGACTTTCC 540 

TTCTATCAGG ATGGAGAAGG AAAAGTGCAT GGCGTTTACC ATCTAGCTCA AGGAGTCAAG 600 

ACAGTTTTAC CGAGAGAACC GTTTGTTCCT GCAGCCTATA TTGAGCAATT GGTGGATAAG 660 

CAAGTCCAGT GGGAGATTGA CTTGGTTCAA ATCACAGGAG ATGGCTCTAA ACCAGAAGAC 720 

TATGAAGCCA TTGCTCGCTT GGACTATGCA AAATTCTTAG AGGTATTACC CCCATCTTTT 780 

TACCACCAAC TAGACGCCAA TCAAATAGAA GTGCAACCCA TATTAGACAA AGATTTTAAA 840 

ACATTAGCAC AAGAAAAGTA AAGCAGAAGC AGGTCAATCG ACTTGCTTTT TTGACATAGA 900 

AAAAATCCTG CCAAGaTGAC AGGATTGCTA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 960 

GCTAGCCGCA GCTGTACTTG AGTACGGTAA GGCGAAGCTG ACGTGGTTTG AATTTGATTT 1020 

TTGAAGAGTA TGAAGTTTAA AGAAAAGCCA AGATACGAAG AT l0 62 
(2) INFORMATION FOR SEQ ID NO: 198: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCGTCAGT cTATCTACAA CCTCAAAAAC 60 

ATGTTTTgAa kGCtcGTCAG tTCTATCTAC AACCTCAAAA ACATGTTTTG AcaGCcTcGT 120 

CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 180 

CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 240 

TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCAGT GCTTTgagcG CTTCGTCAGT 300 
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TCTATCTACA 


ACCTCAAAAC 


AGTGTGTTGC 


GCACCCTTTA 


ATCAGCCGCC 


TAGTCCGCTC 


360 


TATGGTATTC 


ATTAAGTCAA 


CATCTCTTGT 


TTAAGAGCAC 


CAAATCAGGA 


AATCTTCTCG 


420 


ATTCCCTGAT 


TTTTTCTATT 


TACGTTTTCG 


TGTTGAGCTA 


CGTTCTGTCA 


AACCATGAGG 


480 


TAAGAGAACT 


TCACGTTCTT 


CCAACTCTTC 


CTTATGCATA 


ATCTTGGTCA 


ACATACGCAT 


540 


ACTAATGGCA 


CCAAGGTCAT 


AAAGAGGTTG 


GGCAATCGTT GTCKAGTTTG 


GACGGGTAAA 


600 


GCGTGAGATT 


TGTGAATCAT 


CACTAGTAAT 


AATTTCAAAA 


TCTTCTGGCA 


CAGAAACACC 


660 


CTTATCAGCC 


AAACCGTTCA 


AGACTCCTGC 


TGCCAACTCA 


TCACCTGTCA 


CAACTGCTGC 


720 


AGTTGCATTT 


GATGAAATCA 


AACGCTCTGC 


TAAGGCGTAA 


CCATCATCAT 


AGCTATATTT 


780 


AGATTCAAAT 


ACCAAACCCT 


CACTATAAGT 


GATTCCTGCT 


TTTTTCAAGG 


TTTCCTTCTA 


840 


GCCAACTAAA 


CGAACCTTAC 


CATTGATGTC 


ATCCACTAGC 


GGACCGCTAA 


CGAAAGCAAT 


900 


ACGCTCATTT 


TCTTTAGCAA 


GGTAACTCAC 


TGCATCAATT 


GTTGCTTGCT 


TATAGTCAAT 


960 


ATTGACACTT 


GGCAACTGGT 


GCTCAACATC 


GACAGTTCCT 


GCGAGAACAA 


TCGGAGTACG 


1020 


TGAACGCGAA 


AATTCTGAGC 


GAATTTTATC 


TGTCAAGTGA 


TACCCCATAT 


AGATAATGCC 


1080 


ATCTACCTGC 


TTTGAAAAGA 


GGGTATTGAC 


AACAGAAACT 


TCTTTCTCGT 


TATCTTCATC 


1140 


GCTATTAGCT 


AGGACAATAT 


TGTACTTGTA 


CATTTCTGCA 


ATATCATCAA 


TCCCCTTAGC 


1200 


CAAACTCGAA 


AAATAACCAT 


TGGTAATATT 


TGGAATCACG 


ACACCGACAG 


TQGTTGTCTT 


1260 


TTTACTTGCA 


AGACCACGCG 


CAACTGCATT 


TGGACGATAA 


TCCAAACGAT 


CAATTACCTC 


1320 


TAGCACTTTT 


TTACGGGTAT 


TCTCTTTTAC 


ATTTTTATTG 


CCATTGACCA 


CACGGCTGAC 


1380 


CGTCGCCATG 


GAAACACCTG 


CTTCACGAGC 


GACATCATAA 


ATGGTTACTG 


TATCATCTGC 


1440 


ATTCATTCCT 


TTTCCTGTCC 


TTTCTATCTC 


ACACATTCTT 


TTACAAGTAG 


AGGTACTGAT 


1500 


TGAAGCTCTA 


TATCTACTTA 


CAAAAGTGAA 


GATGTGAAAA 


TTTCGTTTTC 


ATATTTCTAC 


1560 


TTATTCCATT 


CTATCACTAA 


TTGTAAACAC 


TTTCAAGTGT 


TTTTTGAAGA 


TTGATTGAAA 


1620 


AAATTTCATA GAAAACCTAG 


GTTTAGCTCC 


TTGCTACCAC 


CTTAGACTAA 


ACAAAAAGGA 


1680 


GGAAACTAAG 


CCCTCCTAAA 


GTTATAGTAA 


AATGAAATAA 


GAACAGGATA 


AATCGATCAG 


1740 


GACAGTCAAA 


TCGATTTCTA 


ACAATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


1800 


TCTACTATAG 


GTATTGTTCC 


ATTCACTACC 


GTCAATTTTA 


GCACATAGTC 


TTCATGAAAA 


I860 


TATTATATCA 


TCATAACCAA 


CCAGATTCTT 


TCGCGATATT 


AGCTGCCTCT 


GTTCGATTAC 


1920 


CTGCATCTAG 


TTTCGAAAGA 


ATATTGGTGA 


CATAGTTTCG 


GACTGTTCCG 


TTGGATAGAT 


1980 


AAAGTTTGTC 


TGCAATTTCT 


TGGTTAGAGA 


AGCCCTGAGC 


AATTCCCTTT 


AAAACTGCGA 


2040 


TTTCTTGCTC 


CGTTAATGGA 


TTGGGATGCA 


TCATCACCAC 


TTCCATCAAT 


TCAGGCGAAT 


2100 



♦ 
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ACTCCTTGCG 


TCCTTCGAGG 


ACGGTGTGCA 


AGGITI'GCAT 


GAGGTCTGCA 


ATGTTTCTTT 


2160 


CTTTTAATAC 


ATAAGCATCT 


ACTCCAGCCT 


TGACCGCACG 


TTCAAAATAC 


CCAGGACGCT 


2220 


TGAAGGTCGT 


CACCACAACC 


ACCTTTGTTT 


CAAGCTTTTC 


TGCTCGTATC 


CACTCCAAGA 


2280 


CTTCAAGACC 


TGTCTTAACA 


GGCATTTCTA 


CGTCAAGGAT 


GGCGATATCT 


ACAGACTCCT 


2340 


TTTCTAATAG 


TTGGATTGCT 


TCTTGCCCAT 


TCTTGGCTTG 


AAACACAGAC 


TCTACATCCG 


2400 


GTTGAAGCAT 


GAGCAACTGG 


CACATGGCAT 


CTCGCAACAT 


ACTTTGATCT 


TCTGCGACTA 


2460 


ATACTTTCAT 


CTACTTTCTC 


TCCTTATAAA CTAGTCGAAC 


CTGCACTTCA 


GTTGGATGTT 


2520 


TCTGACTGAT 


TACACTTACT 


TCTCCTRAAA 


AITOtAAAAAr 1 


ACGATTTCGG 


ACTCTATGGA 


25S0 


GCTCATCCCC 


GCTTATAGAG 


fir*AAAfIPf*Af* 




TCTCACTGTT 


AGAATGAGTT 


2640 


CTTTCTCTGT 


CCGTTCTAAT 






TTTAGCATGT 


TTGATGATAT 


2700 


TGGTCACTAA 


TTCAAGCAAA 






CAATTCCTGA 


GTTAAGCTAG 


2760 


ACTTGTCCAA 


GTGATTCTCA 


ArTTY! 21 A CCH* 
f\\- 1 1 \ j#U\^>W X 




AATTTCTAAC 


ATCTTTTTCA 


2820 


CAGTCTCTAG 


TTCGGATGTC 


nfUw 1 iv_ 1 Al» 


AV.I lAAuAl 1 


TTCCACAATG 


GTTCGCACTT 


2880 


CATTCATGGA 


tCCTTGCTGA 


TCTGGTGAAT 


TTCTTTTAAT 


TCCTTTTCCA 


CCTGTGGATA 


2940 


AGCCTCCATC 


TGAAATAACT 


GCAAGGCTAA 


ATCTGTCTTG 


ACACTCAGCA 


TAGCAAAGGT 


3000 


ATGTCCCAGA 


CTATCATGCA 


AATCCTGACC 


GATACGACTA 


CGTTCATTTT 


CAGCAAGCAA 


3060 


TAGATTTATC 


TGAGCATTTT 


GCTTGACCTG 


AGCTTCTTTC 


AAATCCTCGA 


CAATACGAAT 


3120 


CCGAACCAAT 


CCAAAAGTCA 


TTAAATCGAC 


AAAAGTAAGA 


ATTACAAGTA 


GATAGAATAG 


3180 


AAACTCAACT 


TCGATTCTCT 


GAAAAATCAA 


CAGTTGCCCC 


ACAACAAGGA 


CTTGAGCAAG 


3240 


AAGAAAAGTC 


CAGACATGTA 


AAGACTTTAA 


ACTACGTACG 


CTGAAATGAT 


AACTTAAGAG 


3300 


ATTGGATAGG 


AAAAAGAAAA 


ACCAGATATA 


ATTAACAGCA 


ACAAAGGCAG 


TATTCCCAAC 


3360 


TACATAAGTC 


AGCATGAGGC 


CCCAATATAG 


CCAAGATAGG 


CGCTGGCTCT 


TAGTTGTTAA 


3420 


AACACCCAAA 


TATGCCACTA 


CAAATAGAAT 


ATCAATCAAT 


AAATGCCAGG 


CAGAAAGCCA 


3480 


CCCAGTCACT 


ACAGACAGGA 


TGGGGAAAAT 


CATAAAAATT 


AAACTGATCC 


AAAACATATA 


3540 


ATGTATTCTT 


TTCAGTCTTT 


CAAGCATTAA 


GCATTCTCCT 


TATGACCTTG 


AAGGTAAATG 


3600 


GTCAAACCAA 


ACAAAACTAC 


TGAAAAAACA 


AGTAAATAAA 


CTGTGGCTGA 


TAGATTGATG 


3660 


CCACCCTCAT 


TTAAGAAGGT 


CTTGAGCAAC 


TCCATCAACT 


GATAGGTCGG 


GAGACACTTA 


3720 


CCTACTACTT GCATCCAGTC TGGAAATAAA GAGATAGGCA TCCAGAGTCC ACCTAAAACA 


3780 


GCCAACCCTA 


GATAAAGAAG 


ATTGCCCACG 


ACAGACATCA 


ACTGACTAGT 


TGGTAAGAGA 


3840 
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GTCAAGGTCA 


AACCAAGCGC 


TACGAAGGCA 


ATACTTCCTA 


CTATCAGCAA 


AAGTGCAGCC 


3900 


CCAATCCAAT 


TTCCAAGAGA 


CATCTCCACA 


CCTCTTACAA 


AATGCCCAAC 


TGAGAAAACC 


3960 


ACCAAGATTG 


AAACCAAATA 


ATCAACCAGC 


ATACTTGTTA 


TCTTTGATAG 


ATAATATTCT 


4020 


ACCATATTTA 


CAGGGCTATG 


ACGCAATGTT 


TTCTGCCAGT 


TGTTGATCTT 


GTCGGTATGT 


4080 


AAAACAACTG 


GGAATGAGAA 


GATAGCTGTT 


GACATCATGG 


AAAATGCAGT 


CATGGAGATA 


4140 


AGATAATCAC 


GCATAAAATT 


CGCGAGTTCA 


CCTGGTGTGT 


CCTGATAGAT 


ACCAGAAAAA 


4200 


AATAAATAGA 


AAGCCGTCGG 


CATCCCTACT 


GACAATAGAT 


AATAGATCAA 


TTGTCGTTTG 


4260 


GTCAATAAAA 


ATTCTATCTT 


ACTAAGTGCT 


AGCCATCGTT 


TCATCTTAGT 


TATCTCCCTT 


4320 


CTGCGTTTCT 


TCAAAGATTG 


TATCCAACAA 


ACTACGATTA 


TTAACTTCAA 


TTTCTTGTAT 


4360 


GCCACATCCT 


GCTTGAACTA 


ACAGTTCCCA 


AAAAGCATCT 


GCTTCGCGTG 


TGACTACTTG 


4440 


TAGAGCATCC 


TGTTTTTGTG 


ACCAGTTTTC 


AACCAAGTTA 


GACTGCTCAA 


TGACTTCCTT 


4500 


GTATGCCAGA 


GGAAGGATAA 


AATGCTTTTC 


AATTCCCTCA 


CTACGCATAG 


CTAGAGGCGT 


4560 


CGTATCACGA 


ATCAACTCTC 


CCTTATTTAA 


AACCAAAATC 


CGGTCAGCCG 


TATGCTCTAC 


4620 


CTCTTCAATA 


TAATGAGACG 


AATAGAGAAT 


CGTGACTCCT 


TGCGCTTTTA 


GGTCCCGAAC 


4680 


GATTTCCCAA 


AAGCGTTGAC 


GAGTTGAAGT 


ATCCATGGCA 


GCAGTTGGTT 


CATCTAAAAA 


4740 


GACAAGCTTT 


GGTCGCCCAA 


TCAAGGTCAA 


GACAAAAGAG 


AAGAGACGCT 


TTTGCCCGCC 


4800 


TGACAATTTT 


TCTGCGAATT 


GCTCTTTTTG 


TTGCTGGTCA 


AACTGCAATA 


GTTGATCGAT 


4860 


TTCCTGATCG 


CTCAAGGAAT 


TTGGATAGAT 


ACGTTGAAAG 


AAAGCAATCA 


ACTCTTTGAC 


4920 


CTTTAATTTC 


TGAACGATGA 


CATTTTCTTG 


AGGCAGATAA 


CCTCTAATAT 


AGTCTAACTG 


49B0 


AGAACTCGTC 


ACTGACAAGC 


CTTGGATGGA 


TACTTGACCG 


CTTGTGACCA 


GTTTATCTCC 


5040 


AAGCAGACAG 


TCCAAGAGTG 


TGGTCTTCCC 


AGCACCATTG 


GGCCCAATCA 


AGGCGACGCA 


5100 


TTCACCTTCA 


GCTACCTCAA 


AGGAAATACC 


CTTCAAAATA 


GCCTTGCCCT 


TGATGTTTTT 


5160 


ATTTAGGCTT* 


TCTACCTTAA 


TCATATTCAT 




TTTCAACCAC 


TCCATTCTCA 


5220 


TAAGGAAAAC 


GACGAAAATC 


ATAAATCCAA 


ACCCCAAAGC 


ACCACGAATG 


AATTGGCGAA 


5280 


gCAAGGTTTG 


GTCAAACCAA 


CCTGTAAACA 


TTTCCACTAA 


CCATACCAAG 


AGTGACAGGC 


5340 


CGATAAAGAA 


ATAGATGATC 


CCTCTCTTCA 


TTCCTCAAGC 


TCCTTTTTCA 


CATCTCCGAC 


5400 


TAATTTCAAA 


CCTTCTCTAA 


CAAGCCAAGA 


CATCATTCCA 


AAGCCAGCAA 


AGAGCTCCCA 


5460 


AGGAAAATGA 


TAGAAACTCT 


CATCCAATCC 


CGAAAACATG 


AGTTAGGTCA TAACTCCTGC 


5520 


TACTACTAAA 


CTCACTGCGA TAATCATTTT 


ATTTCTCATC 


TCTTCTTCCT 


CCATTTCATA 


5580 


CTACAATTAT 


AGTCTTTTGA 


AATCAGAGGA 


GACAGAAGCT 


TCTGTCACTA 


GAAAATATGA 


5640 
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CAAATGTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA ACACATATAT 5820 

AGTGTATTGA ATCTATAACA GTAGGCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5880 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC GTTGAACAAA 5940 

ACCCTTCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACTACGGTAT TTATTGAAAA GACTGGAGAC AAAAACTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 6120 

ATCTAGTTTT AAAAATACAC ACTCACATAT CTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCGAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCCGAGTT 6240 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TAGTCAGGTA 6300 

CTCTATCCAG TTGAGCTAAG GGTGCTCCAT ATTATGCCGA GGACCGGAAT CGAACCGGTA 6360 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCGTCT GCCAGTTCCG CCACCCCGGC 6420 

CTCTCTAAGC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 6480 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAGATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCCCCAAAC CCGCATATAT GACCCGTACT GGGCTCGAAC 6720 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 6780 

cTTGCGTTAC CTTAAACGGT CCCGACGGGA ATCGAACCCG CGATCTcGCC GTGACAAGGC 6840 

GACGTG 6846 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GAATTCATTT TAAATAAAGA TACGGGAGAG GTAAGTGAAT TAAAACCTCA TAGGGTAACT 60 

GTGACCATTC AAAATGGAAA AGAAATGAGT TCAACGATAG TGTCGGAAGA AGATTTTATT 120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 
GGTTTCGAAG GTAAAAAAGA CGCTGGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 
AAACCTGTAT TCAAGAAAAT AGAGCAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 
GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 
AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 
GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 
TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 
ATAGTAGGAA TTTTTCTTGG ATTGAAGAAA AAAAATCAAG ATTAAGATAA AAGCTATAGA 
AAAAAATGGT TTATGTACTG AGATTAGATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 
GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 
TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 
GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCTAATAGA GGTGATGAGA 
CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGGACAA 



ATTTTGTCCT 



ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 



AGTTTCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 
GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 
AGGTTTTAGA GGATGAACTT GATTCAGATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 
CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATAGTGGT GTTTCAAGTC 
TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATACTA CACCTCTGCT 
TCTAAAACAT TGTTAGAAAT CGATTTGACT GTCCTGAATG ATTTGTCCTG TTATTATTTC 
ATTTTACTAT AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTCATCT 
CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 
ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCATCTGGA TAAGCTTGTG 
AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGAOTCAGCC 
CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 
CTATTGGACC TTTTTGTCTG GCAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 
TCAGAGTCGG AGTGGTTCGG TAGTACAAGA ATTCCTAGGA GATTATTCTG GCTATGTTCA 
TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC TATGCGATAG 
CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CGCTAGAAGC 
TTATCGTCAA CTGGAAGAAG CTGAACTTGT TGGATGTTGG GCGCATGTGA GAAGGAAATT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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TTTTGAAGCG ACCCCCAAGC AAGCAGATAA ATCATCCTTA GGAGCTAAAC GTTTAGCTTA 1980 

TTGTGATCAG TTATTTTCCT TGGAAAkACA CTGGGAGGCT TTGCCAGCTG ATGAACGACT 2040 

ACAGAAACGT CAAGAACATC TCCAGCCCCT AATGGAAGAC TTCTTTGCTT GGTGCCGCCG 2100 

TCAGTCAGTT TTAGCAGGTT CAAAACTAGG AAGGGCAATT GAATACAGCC TCAAGTATGA 2160 

AGAAACCTTT AAGACTATTT TGAAAGACGG ACATCTGCTC CTTTCCAATA ATCTAGCTCA 2220 

ACGCGCCATT AAATCATTGG TTATGGGACG GAGTAAAAGA GTCCAGTGGA CTCTTTTAGC 2280 

CTGAGCTCAG TTTAAAAAAG CGAGGGTGGT TATTTTCTCA AAGTTTTGAA GGAGCTAAAG 2340 

CAAGAGCTAT TGTTATGAGC TTGTTGGAAA CAGCTAAACG TCATCAATTA TAGTGCGTTG 2400 

AATCTATAAC AGTACGCATC GACTGCTAAA ACATTTCTAT AAATCAATTT TCCTTTCCTA 2460 

ATCGATTTGT TCATATCTTA TTTCAATCCA TTATAAATAG CGAGAAATAT CTATCCTATC 2520 

TTCTAGAATG TCTTCCAAAC GAGGAAACTC TCGTAAACAA AGAGGTTTTA GAGGTTTATT 2580 

TACCATGGAC TAAAGTTGTA CAAGAAAAGT GCAAATAAGA AATCTCCAGA TTAGGAACTA 2640 

TCCGTGAGTT CACTAATCTG GAGATTTTTC AATAGAtTCG TTATTGGGCG GTTACGATAT 2700 

GATcACTACT TCGTCAGTCT TATCTACAAC CTCAAAACAG TGTTTTGAGC AACCTGCGAC 2760 

TAGCrrCCTA GTTTACTCTT TGATTTTCAT TGAATATTAG AACAGAAAAA ATGCTTGGAG 2820 

TATTTGTTTG TGTGTTTATT TTTATATAAC AAACTATAAA CAAAATAAAA ATATAAAAAA 2880 

AGAGACAAAA AAGAACAGAA AGTAATTGAC A 2 9U 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

GAAAATAAGT CTTGACAGAA AGCGCTATCA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGTTTCAAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT 120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAATCAAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 360 
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TTTACAAAAT TCATATCCTC TCCCAACTTT GGTATCTTGT TAGCCAACAC ATTAAAATTA 420 

AGTATCTATG GTTTATTGCT TGGCTTTTTA CCACCAATCA TTCTCGCGAT TATGCTCAAT 480 

CAACTCTTGA GTGAAAAAGT CAAAAAACGA ATTCAGCTCA TTTTATACGC ACCAAACTTT 540 

ATCTCAGTCG TTGTTATTGT CGGTATGATT TTCCTCTTCT TTTCAGTGGG AGGACCAATC 600 

AACAATTTTC TTTCTATGTT TGGAATGAAG GCTGACTTCT TGACAAATCC AGACTTCTTT 660 

AGACCTTTAT ACATCTTTAG TGGTATCTGG CAAGGAATGG GCTGGGCTTC AACGCTCTAC 720 

ACGGCAACAT TGGTAAATGT AGATCCAGCC TTAGTAGAAG CAGCCCGACT GGATGGAGCC 7 SO 

AATATCTTCC AACGAATCTG GCACATTGAT ATTCCAGCTC TTAAGCCTAT TATGGTTATC 840 

CAATTTGTTT TAGCTGCAGG TGGAATTATG AATGTCGGAT ATGAAAAAGC ATTCTTGATG 900 

CAGACATCGT TAAATTTGCC AACTTCTGAA ATTATCTCGA CATATGTCTA TAAAGTTGGT 960 

CTTGTATCAG GAGACTATTC TTACTCAACA GCGGTTGGTT TGTTTAATGC AGTGATTAAC 1020 

GTAGTATTGC TTGTTGCAGT TAACCAAATC GTTAAACGCA TGAATAATGG TGAAGGAATT 1080 

TAAGGAGGAA AGTATGAAAA ATTCGATTAT GGATACAAAA TTTGATAGAC GTATCTTACT 1140 

CTTAAATAAA ATCATTATTG TCTTTATCGT TTTGATGACT TTGCTTCCTT TACTTTATAT 1200 

CGTCGTAGCA TCCTTTATGG ATCCTAAGGT TCTGGTTAGT AGAGGGATTA GCTTTAATCC 1260 

AGCCGATTGG ACTGTAGAAG GTTACCAGCG TGTATTCAGT GACCAATCTA TTCTAAGAGG 1320 

TTTTATCAAT TCTCTACTAT ACTCTTTTGG ATTTGCAGCT TTAACAGTCT TGCTATCTGT 1380 

GTTTACAGCT TATC C TCTTT CTAAGAAAGA CTTGGTTGGA CGTCGTTGGA TTAACTACTT 1440 

CTTGATTGTA ACTATGTTCT TTGGTGGTGG TTTAGTCCCA ACTTACTTGC TCGTAAAAGA 1500 

ATTGGGAATG CTCAATACTC CATGGGCTAT CATTGTTCCA GGTGCTGTTA ACGTTTGGAA 1560 

TATTATTCTT GCTAGGGCCT ATTTCCAAGG ATTGCCTGAA GAATTAGTTG AAGCTGCTGT 1620 

CATTGATGGT GCAAATGATT TACAGATTTT CTTCAAAATC ATGCTTCCTC TTGCAAAACC 1680 

AATTATGTTT GTTCTCTTCC TTTATGCTTT TGTAGGACAG TGGAACTCAT ACTTTGATGC 1740 

AATGATTTAT ATCAAGGATC CAAACTTGGA ACCATTGCAA CTTGTACTTC GTAAAATTCT 1800 

CATTCAGAGC CAACCAGGTC AAGACATGAT TGGAGCACAA GCGGCTATGA ATGAAATGAA 1860 

ACGTTTAGCT GAATTGATTA AATACGCAAC TATTGTCATT TCCAGCTTGC CATTGATTGT 1920 

TATGTATCCA TTCTTCCAAA AATACTTTGA TAAAGGAATT ATGGCTGGTT CACTTAAAGG 1980 

ATAAAAAAAG AAAAAATAAA AGGAGTTTTC TCATGAAATT CAAAACATTC TCAAAATCAG 2040 

CAGTTTTGTT GACAGCTAGT TTAGCACTAC TTGCAGCCTG TGGCTCAAAA AATACAGCTT 2100 

CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT TCAAGAAAAG AAAACATTGA 2160 
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AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC AAATGAAAAG TTAATTTTGC 2220 

AACGTTTGGA GAAGGAAACT GGCGTTCATA TTGACTGGAC CAACTACCAA TCCGACTTTG 2280 

CAGAAAAACG TAACTTGGAT ATTTCTAGTG GTGATTTACC AGATGCTATC CACAACGACG 2340 

GAGCTTCAGA TGTGGACTTG ATGAACTGGG CTAAAAAAGG TGTTATTATT CCAGTTGAAG 2400 

ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT GGATGAGAAA CCAGAGTACA 24 60 

AGGCCTTGAT GACAGCACCT GATGGGCACA TTTACTCATT TCCATGGATT GAAGAGCTTG 2520 

GAGATGGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC TTGGATTAAC AAAGATTGGC 2580 

TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA TTTGATTAAA GTCCTAGAAG 2640 

CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA TGAAATTCCA TTTTCATTTA 2700 

TTAGTGGTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC TGCATTTGGT ATAGGGGATA 2760 

ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAAGTTGA CTTCACAGCA GATAACGATA 2820 

ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG ATTGATAAAG 2880 

AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG TCATGATCAG AAATTTGGTG 2940 

TTTACTTTAC ATGGGATAAG AATAATGTTA CTCGAAGTAA CGAAAGTTAT GATGTTTTAC 3000 

CAGTACTTGC TGGACCAAGT GGTCAAAAAC ACGTAGCTCG TACAAACGGT ATGGGATTTG 3060 

CACGTGACAA GATGGTTATT ACCAGTGTAA ACAAAAACCT AGAATTGACA GCTAAATGGA 3120 

TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA CTGGGGAACT TACGGAGATG 3180 

ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA CACTTACCAC 3240 

TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA AGTAGGAGGA CCACTAGCTA 3300 

TCCTAGATTC ATACTATGGT AAAGTAACAA CCATGCCTGA TGATGCCAAA TGGCGTTTGG 3360 

ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT CAATAACTAT CCAAGAGTCT 3420 

TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA AGCAGATATG AATGACTATA 3480 

TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT TGATACTGAG TGGGATGATT 3540 

ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT CGCTATTAAA CAAAAATACT 3600 

ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAG 3660 

TAGAAAAAGC CAATCGTTTT ATAGCAGAAA ATAAACATCT CGTTAATACT CAATATAAGC 3720 

CTGAAGAACA TTTTTCAGCT CAGATTGGTT GGATCAATGA TCCAAATGGA TTTGTCTATT 3780 

TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTGTT TGGGGGCCTA 3840 

TGCACTGGGG ACATGCTAAA AGTAAGGACT TGGTGACTTG GGAGCACTTG CCAGTGGCAC 3900 
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TTGCTCCTGA 


CCAAGATTAT 


GACCGAAATG 


CTTGTTTCTC 


AGGCTCTGCC 


ATTGTCAAGG 


3960 


ATGATCGCCT 


CTGGCTCATG 


TACACTGGAC 


ATATCGAAGA 


AGAAACCGGT 


CTCCGCCAAG 


4020 


TGCAAAATAT 


GGTATTTTCA 


GATGACGGGA 


TTCACTTTGA 


AAAGATTTCC 


CAAAATCCAG 


4080 


TTGCAACTGG 


ATCAGACTTA 


CCAGATGAGT 


TGATTGCTGC 


TGATTTCCCT 


GATCCAAAAC 


4140 


TCTTTGAAAA 


AGATGGACGC 


TATTACTCCG 


TAGTAGCTGC 


CAAACACAAG 


GATAATGTGG 


4200 


GCTGTATCGT 


TCTACTAGGG 


TCCGATAACC 


TAGTAGAATG 


GCAGTTCGAA 


TCCATCTTTT 


4260 


TAAAAGGGGG 


AGAACACCAA 


GGTTTTATGT 


GGGAATGCCC 


AGATTACTTC 


GAGTTAGATC 


4320 


GGAAAGATTG 


CCTTATTATG 


TCACCCATGC 


GTTATCAGCG 


TGAGGGAGAC 


TCATATCATA 


4380 


ACATCAACTC 


ATCGCTTTTG 


TTCACGGGTA 


AGGTAGATTG 


GAGAGAAAAA 


CCTTTTATCC 


4440 


CAGAATCAGT 


TCAAGAAATT 


GATCATGGCC 


AAGACTTCTA 


TGCGCCTCAA 


ACATTGTTGG 


4500 


ACGATCAAAA 


TCGTCGTATC 


CTGATTGCTT 


GGATGCAGAC 


ATGGGGGCGT 


ACCCTTCCAA 


4560 


CCCATGACCA 


AGAACACAAG 


TGGGCATGTG 


CCATGACTCT 


ACCTAGAATT 


CTAAGATTGG 


4620 


AAGATGGCAA 


ACTAAGACAA 


TTCCCTGTTA 


AAAAAGGCCA 


ATATCAAATC 


CAAATAGATA 


4680 


AAGATTGTCA 


TTACCACTTA 


GGAAATGATA 


TAGATTATCT 


TGAATTTGGT 


TATGACAGTA 


4740 


ATGCGCAGCA 


AGTTTACATT 


GATCGTAGCC 


ATCTTATTCA 


AAAAATTCTA 


GGTGAAGAAG 


4800 


AACAGGACAC 


TAGTCGACGG 


TATGTAGATA 


TTGAAGCTAA 


AGAATTGGAA 


GTTGTTCTAG 


4860 


ATAAAAATTC 


CATCGAGATT 


TTTGTCAATC 


AAGGTGAAGC 


AAGCTTGACT 


GCAACTTATT 


4920 


ACTTAACGGT 


GCCAGCTGAG 


CTATCACGAA 


TTGATTAAAA 


ATTAAGTTAT 


1TCTCCTAAA 


4980 


GAAAAAGTTC 


TCTTTCTAAA 


ATAGTGGAAA 


GAGGACTTTT 


TGTGTTTTGG 


GTATATAAGC 


5040 


TTAGTTTATG 


GTATTTGTAA 


AATTCGTCTT 


GGATTATGAT 


TTAAGCTACT 


TTTCTAAAGA 


5100 


ATTTGAAAAA 


AATTTTATTT 


AAGCAAAAAA 


ACCTTGGTTC 


CAAGGCTTTT 


CCTGTTGTAT 


5160 


TTAGATGCCC 


CCTACAGGGA 


TTGTAGGAGA 


TATGTTGCTT 


AGATGTTCTT 


GATTTTCTGG 


5220 


TGTTTTGTAA 


CGTTTAAATG 


AGTTTTTTGA 


GTTTGTTGGT 


GGGGCGTTGC 


CCGGCAATTG 


5280 


CCCGACTTAT 


TGCTTGAAAA 


AGAATTTAAA 


ATATAGTATA 


GTTAATTATA 


GATTAACACT 


5340 


TGCTTGGAGG 


AACTGATGAA 


GAACAATGAA 


AGATTAGGTA 


TTAAATTAAG 


TAGAGATAGC 


5400 


GTTTTAGGAT 


TGAGGGAAGT 


TAGAAGGCTT 


TATTTAGGCA 


GTTCAGATAT 


CCCAGTTTCT 


5460 


GATGGCTATG 


TGATTGAAGT 


TGCTTATAAC 


CAGATATCAC 


ATGAGATTGA 


TATTATTGAT 


5520 


TGGGTAGAGT 


TGAACAAGTC 


AAAAATTAAG 


ATAAGTGAAA 


TTAGTGAAAG 


CGTGGATATA 


5580 


GATGCCACTA 


GCTTGAGAAC 


AACTTTGACT 


TTAGACACAT 


TAGTATATGA 


AGGTATGAGA 


5640 


GATATACAGT 


TAAAGTTGAG 


AGAGCTTACA 


AAGGGGAGAG 


TATTCTTTTC 


ATTTGTAGTG 


5700 



I 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT TACTAGAAAA ATTTCAAGAA 5760 

AAGTGTTAAT CAAGTATTGA CACTTTATCT GGATTTCCGT ATAATATGCT TAGAAAGGAA 5820 

TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA CGAATACAAA AACATATTTT 5880 

TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA GAGGTAGATA AATGGAATTG 5940 

AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA AATATTTTAA CTTGAATTAT 6000 

AGAAAAGGAG AAACAAATCA TGAAACAAAA ACAACCGATT GTTTCTAGAA CCAAACAACA 6060 

TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA TTGGCTAAGT TGTCGCCCGA 6120 

TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT TCATTTGCGA ACTTGATTAA 6180 

AGAAGCGTAT GGGGGTAAAA ATCTAAACGT AGTTTATGCG AGTCGGATGT TGGCTCTCTG 6240 

GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT TCTTTAGCAG ATGCGCTTTT 6300 

TAGTGATAAA AAAATTTGTC TAGATTCTTA CTATTACCAC AAGAATACCT CTAATACCAT 6360 

AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT AATTATATGG TTTTAACTCG 6420 

AGAAGCGACA CCTGAATACA TTTATGTTGT ACAAACTGAA ATGCCAAAAG ATTCAGATTT 6480 

ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCGTTTAGT ACCATGCATT ATGCATTTTT 6540 

AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG CCATATCGAA ATTGAATTAT 6600 

TTAAATTTAT ACTCTTCGAA AATCAAATTC AAACCAAGTC AGCTTCCCCT TGCTGTACTC 6660 

AAGTGCTGTC TGTGGCTAGC TTCTTAGTTT GCTTTTTCAT TTTCATTGAG TATTACTCTT 6720 

ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT TATAGCGAAA ATTTTAGGTT 6780 

CTATAATATT TGTAGTGGGT AAACCACTAT AGATATTATG GAGCCTATTT ATTGTAGAAA 6840 

AAAGTCCCAT ATGA 6854 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGCT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG AGATTTTCGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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CACTTCTGCA 


TCTTCATAGT 


TTGATATCAA 


AATCTGTCCA 


TTTTGGTAGA 


CTGCTGGCAA 


240 


GTCGATTTcA 


CTTCTTTAGC 


ATAAAAGTTA 


TTGAGCACTA 


GTAACTTTTG 


ATCCTCAAAC 


300 


TGGCGTTCAA 


AAGCGTAGAC 


TTGTTTGCTA 


TCTTCAAAGG 


CTGGTTTGTA 


ACTTCCTTCT 


360 


GAAATGATTG 


GCATTTCCTT 


ACGCATCGAA 


TCAAGTCTTG 


ATAGAAGGTA 


AAAATCGGAC 


420 


CCTCGATTTC 


ATTTTCTACA 


TTGATGTATT 


TATAGGATTT 


ACCAGCTTTC 


AACCAAGGAG 


480 


TGCCTGTTGA 


AAATCCTGCA 


TTTTCCGAAG 


CATCCCACTG 


CATGGGAATG 


CGTGAATTAT 


540 


CACGCGACTT 


AGCTTGAATA 


ATCTGGAAGG 


CTTCTTGCTG 


ACTCTTTCCT 


TCTTCTAAGA 


600 


GCATCTGATA 


GGCATTAAGC 


GATTCGACAT 


CCACATAATC 


AGCCATAGAA 


TCATAGTCTG 


660 


GGTCAATCAT 


CCCGATTTCC 


TCACCCATGT 


AGATATAAGG 


TGTCCCACGT 


GACAGGTGAA 


720 


TGCTGGCTGC 


TAGCATGGTG 


GCTCCTTCCT 


TGCGGAAGTT 


TTGAATATCG 


ACAAAACGGT 


780 


TCAAGGCACG 


TGGTTGATCG 


TGATTATTCC 


AAAAGAGGGC 


ACTCCAACCG 


TCTTTATCAC 


840 


TCATTTCCTT 


ACCCCAACTA 


TGGTAAAGAC 


TCTTCAACTC 


TTCAAAATCA 


AAGGGAGCCA 


900 


AGGTCCACTT 


TTGTCCATCC 


TTATAGTCCA 


CCTTGAGGTG 


ATGAAAATTA 


AAGGTCATGG. 


960 


ATAATTCCTG 


ACGATCAGGC 


GACGAATAGA 


GGACACAGTT 


TTCCATGGTG 


GTAGAAGACA 


1020 


TTTCCCCAAC 


TGTCATAAAG 


CTATCGTCGG 


ATCCAAAAGT 


GGCTTGGTTC 


ATCATACGCA 


1080 


AATAGTTATG 


AACGATGGGT 


TTGTCTGTAT 


AAGCTGGCTT 


CCCTTCATTT 


TCAGGACAGT 


1140 


CCACTGAAAC 


CTCGTCCTTA 


CCGATCAAAT 


TGATCACATC 


AAATCGGAAA 


CCTTTGACAC 


1200 


CCTTGTCGCG 


CCAGAAATTA ACAACCTTGA 


AAAGCTCCTT 


ACGGACATTG 


GAATTGCGCC 


1260 


AGTTAAGGTC 


AGCCTGGGTC 


TCATCAAATA 


GGTGAAGATA 


GTATTTCCCA 


GTATCCCCGA 


1320 


AAGGCGTCCA 


TGCAGAACCA 


CCAAACTTAG 


ACTGCCAATC 


TGTTGGTTGG 


TCTTGGATGA 


1380 


AGAAAAAGTC 


TTGATAATAC 


TTATCACCAG 


CTAGGGCTTT 


CTGAAACCAT 


TCATGCTCTG 


1440 


TCGAACAATG 


ATTAAGTACC 


ATGTCCAGCA 


TAAAGTCAAT 


CTTGTGCTCT 


TTACCGACAC 


1S00 


ACACCATTTT 


CTCAAAATCA 


GCCATATCAC 


CAAAAAGAGG 




r\ k t\i rtJ-v i A_ i\r 




AAATATCGTA 


ACCATTATCC 


CGTTGAGGGC 


TTGGATAGAA 


TGGATTGAGC 


CAGACCATAT 


1620 


CCACACCTAG 


TTTGGCTAAA TAGGGAATTT 


TTTCGATAAT 


CCCACGGAAA 


TCCCCAATAC 


1680 


CGTTTTCAGT 






GATAGATTTG 


ATAGACTACT 


TTTCCTTTAT 


1740 


CAAGTGTCAT 


CTGTTTCTCC 


TTTTCTGATA 


AAAGGGAGGA 


AGCAGTCTTC 


CGTCCCTATT 


1800 


TGTGCTATTT 


CAATTATACT 


CAATGAAAAT 


CAAAGAACAA 


ACTAGGAAGC 


TAGCCACAGG 


1860 


TTGCTCAAAA 


CACTATTTTG 


AGGTTGCAGA 


TAGAGCTGAC 


GTGGTTTGAA 


GAGATTTTCG 


1920 


AAGAGTATTA 


GATTCGTGTA 


GCGACCATGA 


GAGATGCTCC 


AGCTTGGATC 


GTTGTCUGAT 


1980 
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AAGTTCCGGG AATAGTCGCT GTATAAGCAT CTTGGTTGCT GATGATAACA GGAGTTTCTG 2040 

TCACCAGACC TGCACCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 2100 

TAACGTGATC TCCTTGGACT ACAAGACTTT CAAAACCTTT CCCATCAAGA CCTACTGTAT 2160 

CCATACCGAT GTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG ATGGCATGCT 2220 

TGGTAGGGAA AAGAACCGTC ACTGTCCCAT TAACTGGAGA GGTCAACTCA CCTTGGCTTG 2280 

GTTCAATGAC TAGACCTTGC CCCATGACAC CTGATGCAAA AATACGATCC GTCGCTTGAC 2340 

TCAATTCTTT CACTTGGCCA GTTACTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 2400 

GTTCATGGTT CACAAATTCT GCTTCTTCTT GAGCAACGAA TTCTGCCTGC AAGTTCGTAT 2460 

CGCCCTCTGT TTTTGTAAAG AGACCAGCCT TGCGGAAGAA GAAAGTCAAG ACCATTGGAA 2520 

CAACAATCGC AACTAGCATA GTTCCTGCAA ATGGCAGCAT GTATTGAGGT TGAATAGAGA 2580 

GAATACCTGG CAAACCACCG ATACCAATAG AAGCCGCAGT TACATTAAAA GTAACGGATA 2640 

ACATGCCTGC AAGGGCTGAA CCAGTCATCC CAGCAACAAA TGGATAAATA TATTTTACGT 2700 

TAACCCCAAA AAGAGCTGGT TCTGTAACAC CGAGATAGGC TGAAATGGTT GCAGGAAGTG 2760 

AAACCTGAGC CTCACGCTCA TCATGGCGAT GCATGAAATA ATAGGCAAAC ACGGCTGAGC 2820 

CTTGAGCAAT ATTAGAAAGA GCAATCATTG GCCATAGGGC AGTGCCACCA GCATCCGCAA 2880 

TCAATTGTGT ATCAATGGCA TTGGTCATAT GGTGCAGACC TGTGATGACA AATGGAGCGT 2940 

AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGG ACCAGTTAAA CCTGCCAAGA 3000 

CAACTGATGA AAGTCCTTGT CCAATTGTCC AACCGATTGG TCCCAAAACA GTATGAGCCA 3060 

AAATCAAGGC TGGAATCAAT GACAAGAAAG GTACAAAAAT CATAGAAATG ACTTCTGGGA 3120 

TATGCTTGTG CCAGAAGATT TCAAGATAAG ACAGACTCAA ACCTGCAAGC AAGGCTGGGA 3180 

TAACTTGGGC TTGGTAACCG ATACGATTAA CAGTAAAATA GCCAAAATTC CAAACCCAGT 3240 

TTGCCGCGAT ATCAGCTGCT GGCGTTGAAG CAACCGCATA GGCATTGAGC AACTGAGGTG 3300 

ATACCAAACA GATTCCGAGA ACAATTCCCA AAATTTGGCT GGTTCCCATC TTACGAGAAA 3360 

CAGACCAAGT AATCCCTACT GGTAAGAACT GGAAGATACC TTCACCAGGC AACCAGAGGA 3420 

AGTGATTGAC ACCTGCCCAA AACTGAGAGG ATTCTGTGAT GGTCTTGCCA TCCAACATCG 3480 

ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCCTCCG ACTATCAAGG 3540 

CTGGAATAAT CGGAGTAAAA ATCTCCGCCA GAGTGGTCAT AACACCTTGG ACCACGTTTT 3600 

GATTACTCTT AGCTGCAGAC TTGGCTGCTT CTTTGGAAAC ACCCTCAATA CCTGAAACGG 3660 

CTGTAAAATC ATTATAAAAG ATGGGCACGT CATTTCCAAT GATTACCTGA AATTGACCTG 3720 
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CATTTGTAAA GCTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 

TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG ACTTACGGCA GTCACATTTT 3840 

CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CCCGG 3895 



(2) INFORMATION FOR SEQ ID NO: 202: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 
(B| TYPE: nucleic acid 
<C) STRAND EDNESS : double 
{D> TOPOLOGY : linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



AGGATCGCCG 


CTCCAGCTAC 


TAAGTCTCGT 


GCAGTGCCGA 


TTTATCAAAC 


AACATTTTTT 


60 


GTTTTTGATG 


ACACGTAGGA 


AGGTGCCGAT 


CTGTTTGCCT 


TGAGGAAACC 


AGGGAACATT 


120 


TATACTCGTA 


TCACCAATCC 


TACAACAGCT GCCCTTGAAG GTGGTGTTGA AGCGCTAgcA 


180 


ACAGCATCAG 


GTATGACTGC 


AGTGACTTAT 


ACGATTTTGG 


CGATTGCCCA 


TGCTGGTGAC 


240 


CATGTAGTGG 


CTGCTTCGAC 


TATTTACGGT 


GGAACCTTCA 


ATCTTTTGAA 


AGAACCCCTT 


300 


CCTCGTTATG 


GTATCACAAC 


AACCTTTTTC 


GATATTGATA 


ATTTGGAGGA 


AGTAGAAGCA 


360 


GCTATCAAAG 


ACAATACCAA 


GCTTGTCTTG 


ATTGAAACCT 


TGGGTAACCC 


CTTGATTAAT 


420 


ATTCCAGACC 


TGGAAAAACT 


GGCAGAGATT 


GCTCATAAAC 


ATCAAATCCC 


ACTTGTGTCA 


4B0 


GACAATACTT 


TTGCAACACC 


TTATTTGATT 


AACGTCTTCT 


CTCATGGCGT 


TGACATTGCC 


540 


ATTCACTCTG 


TGACTAAGTT 


TATCGGTGGG 


CATGGTACAA 


CTATTGGAGG 


AATAATTGTC 


600 


GATAGTGGTC 


GTTTTGACTG 


GACGGCTTCA 


GGGAAATTCC 


CTCAATTTGT 


TGACGAGGGT 


660 


CCAAGCTGCC 


ACAATTTGAG 


CTATACTCGT 


GATGTGGGTG 


CAGCAGCCTT 


TATTATAGCT 


720 


GTTCGAGTTC 


AATTGCTTCG 


TGATACAGGT 


GCAGCCTTGT 


CACCATTCAA 


TGCTTTCCTC 


780 


TTGCTACAAA 


GACTTGAAAC 


CTCTTCACTT 


CGTGTGGAAC 


GCCATGTACA 


AAATGCTGAG 


840 


ACAATTGTTG 


ATTTTCTTGT 


CAACCATCCT 


AAGGTAGAAA 


AGGTAAATTA 


TCCAAAACTT 


900 


GCAGATAGTC 


CTTATCATGC 


CTTGGCTGAG 


AAATACTTGC 


CAAAAGGTGT 


CGGTTCAATC 


960 


TTTACCTTCC 


ACGTCAAAGG 


TGGCGAGGAA 


GAAGCACGCA 


AGGTCATTGA 


TAATTTAGAA 


1020 


ATCTTTTCTG 


ACCTTGCAAA 


CGCGGCAGAT 


GCTAAATCGC 


TTGTTGTCCA 


TCCAGCAACA 


1080 


ACCACTCACG 


GTCAATTGTC 


AGAAAAAGAC 


CTAGAAGCAG 


CAGGTGTCAC 


ACCAAACTAA 


1140 


ATTCGTTTGT 


CAATCGGTCT 


TGAAAATGTA 


GAAGATTTGA 


TTGAAGACTT 


GCGCTTGGCC 


1200 


TTGGAAAAAA 


TTTAAAGTAA 


AAGAAGATAA 


ACAGTGGGCT 


TCGACTCACT 


GTTTTTGATT 


1260 
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TTCCCTCAGG CATGATATAA TGGTTACAGA AGTCTAGAAA GAGGAACGAT ATGAACGAAA 1320 

TCAAATGTCC CAACTGTGGG GAAGTCTTTA CAGTAAATGA GAGTCAGTAT GCCGAACTCT 1380 

TGTCCCAAGT GAGAACGGCA GAGTTTGATA AGGAACTACA CGATAGGATG AAGCAGGAAC 1440 

TGGCCTTGGC TGAGCAAAAG GCCATGAATG AGCAACAGAC TAAACTGGCT CAGAAGGATC 1500 

AAGAAATTGC GCAATTACAG AGTCAGATCC AAAACTTTGA TACAGAAAAA GAATTGGCCA 1560 

AGAAAGAGGT TGAACAGACA AGCCATGAGG CTCTCTTGGC TAAGGACAAG GAAGTACAGC 1620 

TCTTAGAAAA TCAGTTGGCT ACCTTCCGTT TGGAGCATGA AAATCAACTA CAAAAGACCC 1680 

TTTCTGACCT AGAAAAAGAA CGGGATCAGG TTAAAAACCA ACTACTTTTG CAGGAAAAGG 1740 

AAAATGAATT ATCTTTGGCT TCTGTTAAGC AAAACTACGA AGCCCAGCTC AAGGCAGCTA 1800 

GTGAACAAGT CGAGTTTTAT AAGAATTTTA AGGCTCAACA ATCTACAAAA GCGATTGGGG 1860 

AAAGCCTAGA ACAGTATGCA GAGAGTGAGT TTAACAAGGT TCGTAGTTTC GCCTTTCCAA 1920 

ATGCTTACTT TGAGAAGGAT AACAAGGTCT CTTCGCGTGG GTCTAAAGGG GACTTTATCT 1980 

TCCGTGAGTG TGATGAAAAT GGAGTTGAAA TCATTTCTAT CATGTTTGAG ATGAAAAACG 2040 

AAGCGGACGG AACAGAGAAG AAGCACAAGA ATGCAGATTT TTACAAGGAA TTGGACAAGG 2100 

ACCGTCGGGA GAAGAACTGT GAGTATGCCG TTTTGGTGAC CATGCTTGAG GCTGATAATG 2160 

ACTACTTTAA CACAGGGATT GTTGACGTCA GTCACGAGTA TGAAAAAATG TATGTTGTTC 2220 

GTCCTCAATT CTTTATCCAA TTGATTGGTC TCTTACGTAA TGCGGCGCTA AATTCCCTAA 2280 

AATACAAGCA GGAGTTGGCC TTGGTTCGCG AGCAAAATAT TGACATTACG CATTTTGAGG 2340 

AAGATTTGGA TGCCTTTAAG CTAGCTTTTG CTAAGAACTA TAATTCAGCT TCGACTAACT 2400 

TTGGAAAAGC TATTGATGAA ATCGACAAGG CCATCAAACG CATGGAAGAG GTTAAGAAAT 2460 

TCCTGACCAC ATCTGAAAAC CAACTCCGTT TAGCTAACAA CAAATTGGAA GATGTCTCTG 2520 

TTAAAAAATT GACCCGGAAA AATCCAACAA TGAAAGCGAA GTTCGAAGCA CTGAAGGGGG 2580 

AGTAGAAAGC AAAAATGAAC GGTATTATTA ACTTAAAAAA GGAAGCAGGA ATGACCTCGC 2640 

ATGATGCGGT TTTTAAACTG CGTAAGATTT TGGGAACCAA GAAAATTGGT CATGGTGGAA 2700 

CCTTGGATCC GGATGTGGTG GGTGTTTTGC CGATTGCGGT TGGCAAGGCG ACACGCATGG 2760 

TCGAGTTTAT GCAGGACGAG GGTAAGATCT ATGAGGGGGA AATCACTCTG GGCTATTCCA 2820 

CGAAGACTGA GGATGCTAGT GGGGAAGTGG TCGCAGAAAC CCCTGTTTTG TCTCTCTTGG 2880 

ATGAAAAGCT TGTTGATGAA GCGATTGCTA GCTTGACTGG GCCTATTACT CAGATTCCCC 2940 

CTATGTATTC GGCAGTTAAG GTTAATGGTC GCAAGCTCTA TGAGTATGCG CGTGCTGGTC 3000 



WO 98/18931 



PCT/US97/19588 



1184 

AGGAAGTGGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCCA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGT 3120 

ACATCCGTAC TTTGTCAGTT GATTTGGGTG AAAAGCTTGG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3240 

TTGCTGAAAA AGTAGAGGCT GGGCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AGAAGTTCGC TTTGGTCGTT 3360 

TTATTGAGCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3420 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA GGTTTTTAGC TAGATCCTTT 3480 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3540 

ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT 3600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA ATATTTAGAG 3660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCAGGATTTG GTGTGATAAT TCCATGATGC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AGACAGAAAT GAAAAATACC 3840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATCATTACC 3900 

AGGCACCTAA GTCTGTCATT GATTTTGGTT TATGGA 3936 
(2) INFORMAT I ON FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 240 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AGCGATTCAT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 360 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC ACGACTTCCT TTTTTGAACT CATCACGGCG 480 
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ACCATTCCCA CGGCGATCAC GCTCTCGACG GTCGTCCCCA CGACGGCCTC CACGACCTCC 540 

CTTAGCTTTA CCACCGAAAC CATTACCTGA TGGTTTAAAC GGTAGTGGtT TTTCACGTGC 600 

AATCTCCACT TCTGGAAGGC TATCTGGGTC TTGGACTGTC AGACTCAAGA TATACATTGC 660 

CAATTCTTCT GGAGTAAACT CAGCAGCCAA TTTGCGAGCA TCCTTACCAA ATTTCTCAAA 720 

GTTGGCACGA ATGGTTTCAT CTGCAAAATC ACGTTCGATT TTCTTGAGAG CTACCTGTTT 780 

TTTTGATTGG AACGATTCTT CTACACTTGC AGGTTTGAGA CCTTTCATGC GTTTCTTAGT 840 

CAAGTTTTCA ATGATTTGAA GGTAACCCAT TTCGTTTGGA GCAACAAAAG TAATAGATTG 900 

ACCTGACTTA CCAGCACGAC CTGTACGACC GATACGGTGA ACATAACTCT CAGGATCTTG 960 

TGGAATATCG TAGTTGTAGA CATGGGTCAC ACCTGAAATA TCCAAACCAC GCGCTGCAAC 1020 

GTCTGTCGCA ACCAAAACAT CAAGATTGCC ATTTTTAAAG TCACGAAGGA CACGAAGACG 1080 

TTTGTTTTGG TCTAGGTCGC CATGAATTCC TTCTGCACGG AAGCCACGAA TTTTCAAACC 1140 

ACGAGTCAAT TCATCCACAC GGCGTTTGGT ACGACCAAAT ACAATAGCGA GTTCTGGTTG 1200 

TGCCACATCC ATGAGACGAC TCATGGTGTC AAATTTTTCT TGTTCCTTAA CACGGATATA 1260 

GTACTGGTCA ACCAATTCTG TTGTCAATTC CTTAGCCGCA ATCTTGACAT GTTCAGGGGC 1320 

TTTCATAAAC TGAACACCGA TACGTTTGAT GGCATCTGGC ATAGTTGCTG AGAAAAGCAA 1380 

AGTTTGACGG TTCTCAGGTA CACGGGAAAT AATGGCTTCG ATGTCTTCAA GGAAGCCCAT 1440 

GTTAAGCATT TCATCCGCTT CGTCAAGGAT AAGGGTTTCA ATGTCTTGTA ATTTCAAGGC 1500 

CTTGCGTTTA ATCAAGTCCA AGAGGCGACC TGGAGTTCCC ACCACAATAT GGGCACCAGA 1560 

TTTAAGAGCC TTAATTTGTT TTTCAATGCT TGATCCGCCA TATACTGAAC GGACTTTGAC 1620 

TCCCTTACTA CGACCAAAGC GGAAGAGTTC TTCTTGACTT TGGACAGCTA GTTCACGAGT 1680 

TGGAGCGATG ACCAAGGCTT GGATAGTCGC TTCTTCTGTA CGGATTTTTT CAAGGGTAGG 1740 

CAAGCCAAAG GCTGCAGTTT TTCCTGTACC AGTCTGAGCT TGACCGATAA CATCCTTGCC 1800 

TTCAAGGGCC AAAGGAATAG TTTGTTCTTG GATAGGACTA GCTTCTACAA AACCAGCTTT 1860 

TTCAATTTCT GCTAGCAAAT CAGCAGACAA GTTTAATTCA TTAAATTTCA CGTTATTCTT 1920 

CTTTCTAAAG GTGGTGCGAA GCCACCCTAT AGGGCTTAGT TTATACTTTT CTTTTTATGA 1980 

CGTATTTTCA TATAACTAGA TATAAAATCG TGTTGCTTCT TTTCCACAAA AGAAAAGTAC 2040 

TGTTTTCTTT GCAACCTATC TAGTATAACA CAAGACCAGA GCAAAAGATA GCCCCATTTC 2100 

TACAGAAAAT CATGTAAGCG CTTTTTGACT TTCTTTTTTG ATTGAACGAC CTAGATAATA 2160 

AGACAAAGCC AAGGCGATAC TGTATAAAAT GAGAAAAACG AACAAGGTTT GTGTGTACGA 2220 
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ATGAGCCATT 


TTATAAGTCT 


CTGCTAATAA 


AATAGGTCCC 


GCTAAACCAG 


CCATTGCCCA 


2280 


AGCTGTTAAA 


ATATAACCAT 


GCAGAGCGCC 


CAATTCCTTG 


CTTCCAAAAA 


TATCACTGAG 


2340 


ATAAGCTGGA 


A7CAAAGAAA 


AACCAGCTCC 


ATAGCAAGTC 


ATCAAAATAG 


ACATAGCAAC 


2400 


TACAAATAAA 


ACGGAATCTG 


TAAAGAGCCA 


AAGTGAGAGA 


GAAAAGAAAA 


GATTGACAAG 


2460 


CAGTAATATA 


CTAAAGGTTA 


GAGGGCGACC 


GATATAGTCA 


GACAAACTCG 


CCCAGAGCAA 


2520 


GCGACCAAAT 


CCATTGAAAA 


TCCCCAAAAC 


ACCCACCATT 


ACTGCTGCAT 


GACTTGTAGA 


25B0 


CAAGCCAGCC 


ATCTCCTGTG 


CCATTGGCGA 


TGCCGCTGAA 


ATTAAGCCTA 


AACCACAAGC 


2640 


TATGTTGATA 


AAGAAAATAA 


TCCAAAGCAT 


ATAAAACCGA 


TTGCTTTTTA 


GAGCCTGATT 


2700 


TGCAGCCATT 


CCTTGCGTCA 


AAGAGGCTGT 


TTTTTCTTTC 


CCTGAAGAAG 


ATAAAATTGC 


2760 


1 


TCAl I rGGAC 


GCTTAATGAA 


TTGTGAAGCT 


AGGAGCATGA 


TAATAAAGTA 


2820 


ACTTGCTCCT 


AAAATATAAA 


AAGTTTCTAC 


AAGCCCTACC 


CCTGCGATGA 


GGTGTTGCGC 


2880 


TATGGGACTA 


GTCAATAAAG 


AAGCAAAACC 


AAACCCCATA 


ATCGCTAAAC 


CTGTTGCGAG 


2940 


ACCACGTTTA 


TCAGGAAACC 


ATTTTATAAT 


CGTCGACACA 


GGGGTAATAT 


AGCCTGCTCC 


3000 


CAAACCAAGC 


CCACCTAAAA 


TGCCATAAGC 


GAGATACAAC 


AACCACAGCT 


CTGACGGTCT 


3060 


ATTGCAAATC 


CTGTTAAGAT 


ATTTCCACCT 


GCGTATAGAA 


AAGCAGATAG 


ACTTCCCATG 


3120 


ACTTTCGGAC 


CAAATTTTTC 


TACCAAACGC 


CCCATAAATG 


CAGCCGATAA 


GCCCAAACAA 


3180 


AAGATTGCTA 


GACTAAAGGC 


GAAGGCAACA 


GAAGCCTGAT 


CCCATCCCGT 




3230 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCTATGAAGA CTGTCCCAAC TGGGTGTCCT TCTAGGCTAT CTGGTCCTGC CACTCCAGTC 60 

AAACTAATTC CAAAATCAGA CTGGGTCTTG CTTCGTGCCT GCTCAGCCAT CTTCTGAGCT 120 

GTAAATTCAG ACACCACACC ATGTTCTTCC AAATTCTTGG CAGGAATATC CAACATCCTT 1B0 

GATTTTTCCT CCAAGCTATA GGTCACAAAA CCACCCTTAA ATATACTTGA AACTCCAGAA 240 

AAATTCGCCA CGGTAGCTTG GAAAAGACCT GCCGTCAAAC TCTCTGCAGC CGCGATGGTT 300 

TTCCCTTGCC TTTTCAGTTC TTCTACCACA ATGCTGGCTA AACTAGTTTC TTCCCCATAA 360 

CCATAGCAAA AGTCTCGTAA AGAAATTCCT TCGAAAGTCT GGCAGTCCAA GATTTGATTT 420 
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TCCAACATAT CCAGCGCTTG ATTCGCCTCT TCTTGACTGC TAGCCTTTGT TGACAGACGT 480 

AGAGTGACTT CTCCTGTCTT GCCATAAGGG GCCAAGGTAG GATCGATCTG ATTATCAATT 540 

AAATCAGCCA AAATCGTAAC CAACTGGCTC TCGCCAATCC CAAAGAAACG AAGAACTCGG 600 

GAATACAGCT TGCTCCCTGT CATCAACTTG GGTAGAAGTT GGTTTAAGAC CATGCGTTTC 660 

AATTCACTTG GCGGACCTGG AAGGACGACA TAGGTCACTC CGTCTACTTC TAATTTTCCT 720 

CCAACAGCCA GTCCTGTTTC GTTTGGCAGT GGAATCGCTC CTTCTACAAT TTGAGCTTGT 780 

CTTTCGTTAT TCGGTGTTCG GGCATAGTCT GGTCGCAGGG TAAAAAAGAT ATCCAACTTC 840 

TCCTGAGCCT GAGGATCAAA GACTAATGCT TTCCCTAAAA ATTTAGCTAG GGTTTGTTTG 900 

GTTAGGTCGT CCTCAGTTGG CCCCAAACCG CCTGTCAAAA TCACCAGACT GCTACGTTGA 960 

CTGGCAATCT CAAGCAAAGA CAAGAGACGA ACTTCATTGT CTCCTACAGC CGTCTGAAAA 1020 

TATACATCTA CCCCAATCTC AGCTAGTTTT TCCGACAAAA ACTGGGCATT GGTGTTGACA 1080 

ATCTGCCCTG TCAAAATCTC TGTTCCAACA GCAATGATTT CTGCTTTCAT GTTTCCTCCT 1140 

ACCTATCTAT TCGTATTTTT TTGAAAAAAT CGCAGGAATT TTCCTACGAT TGATTTTTTT 1200 

ATTTGTATCA AAAGTTAATT ATCTTCATCA CCAACAGGTG CTCTGCCAAA TAAATCTTCA 1260 

AATAAAACCG CATTGGTTTC AAGCTGAGTA ACTTCTTCTT GTCCCAAAGA ACGTCGGAGT 1320 

AGATTTTGCA TTTCCAACAT ATGTGCTCTC GAAACAATCT GGTAAGAAAC ACCTTGAAGT 1380 

ATCTCTCCTT CACCCTGCAA CTGCTGAGTT TCAATGGTTT TAAATGAATC TTTATAGCCT 1440 

AGCAAGTTAG GG AT AC TTTT TGCAGACAAA TCAATATTGG TCTGCATATT GTCACTCAAA 1500 

GCTTTTAGAA TCTCTTGATA ATGACCAATG CTATTTAAAC TGAGAGCTTT TTCCATGACT 1560 

TTTTGAATAA CTTCACGTTG ACGTTTTTGA CGACCATAAT CCCCCTCAGG ATCTTGGTAA 1620 

CGCATTCGTG CATAGACTAG GGCTTCTTCT CCCCCAATAT GTTGCTCCCC AACACCGATA 16 BO 

GAAATAGTAT TAAATTCTTC TTGGTCACTG ATAGAAATTG GGAAACCTAG GATATTATTG 1740 

ACTGTAATAC CTCCTACTGC ATCCACTAGT TTTTGCAATC CTCTCATATT GACCATCACA 1800 

TAGCGATCAA TATGGATATT CATCATTTTT TGAATGGTTT CTATAGCAAG CTCTGCTCCA I860 

CCATCTGCAT ATGCTGAGTT CAGTTTCGCT TCATGAGCCT GACCATTCCC TGATTCAATG 1920 

CGCGTCAGAA TATCCCGCTC TAAACTCATC ATTGTTGTTT TTTTCGTTTT AGGATTCACT 1980 

GTCATCAAGA TCATGCTATC ACTTCTACCG ACCCAAGTTT CAGTTCGTTC AACATTTCCG 2040 

GTGTCCACTC CCATTAACAG AATGGTTAGA GGTTCAGTCG CTTCAATAAC CTTGGTTTCT 2100 

TCACCGATTT TTTTATACGT TTTAGCTAAG GTTTCTGTCC CTTGTTGATA AATAGTATAA 2160 
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GCAAAAACAC 


CTACTCCTAC 


TACAGTTACA 


GAAAGTAAAG 


CTAGCACCAT 


TCCAATAATT 


2220 


TTTTTAACCA 


TATTTCTACT 


AACCTATCAG 


TTTACCCATC 


AAGTAAACAT 


CGATAAATTT 


2280 


CCCTTCTTCT 


ATATATGCCC 


CACGCTCTTG 


GCTACCTTCA 


ATGACAAAGC 


CATGCTTTTG 


2340 


ATAAAGATCG 


ACTGCTGCTT 


GATTACGAGT 


TTGGACAGTC 


AGTTGGAGAC 


GACGCAGAAT 


2400 


GCCACTTGCT 


TGTGCCCACT 


CTATCGCTTC 


TTCTAGCAAC 


AAACTTCCCA 


AGCCATTATT 


2460 


CCAATATCTT 


TTTCCAATCA 


CAATGAAGAG 


ATCTCCAATA 


TGACGGACTC 


TCTTACGCTG 


2S20 


ATCAGCTGTA 


ATATTTACAA 


TACCAGCAAT 


TTTGCCATTT 


AAGAATGCAA 


GTAAGGTTAT 


2580 


CTGATTGTCC 


GAACTAGCTT 


GCTTGTTGAG 


CAATATTTCC 


ATCTCCTCAC 


TAGTCAAGAG 


2640 


AATACCATCT 


CCGTCTAGGC 


TGGTAAAGTC 


TGTCTCCAAA 


CTCACACGAT 


TTAAAAAGGC 


2700 


CACTAATTCA 


GCTGCATCTT 


TGGGCTCTGC 


TTCCCTAATG 


AGCAATTCAT 


ACTCCATATT 


2760 


GAAGCTCCTC 


TAACAATTTC 


TCAGCACGCA 


AACCCTTTGC 


CTGAAAATTT 


AAACGGCGTC 


2820 


CATCTGCTTC 


TTTTAGAATT 


TCCAATTCTA 


AATAAGCATC 


TGGCAAGGCA 


TCTCCTAAGA 


2880 


GATTTCCCCA 


CTCAATAACA 


GTCACGCCGC 


CACCAAAGAT 


AAACTCATCC 


AAGTCGATAG 


2940 


AATCAGCATC 


TCCTTCAATA 


CGATAAACAT 


CTAGGTGATA 


AAGTGGAAGT 


CGACCTTCAT 


3000 


ACTCTCTCAC 


GATAGTATAG 


GTGGGACTTT 


TAATCATTTG 


AGAAATCTGT 


AATCCTTTTC 


3060 


CAAGTCCTTT 


AGTAAAGGTC 


GTTTTACCTG 


CACCCAGTTC 


TCCAGTTAAG 


ATTAAAACAT 


3120 


CATTCTTTGC 


TAATAGATGG 


CCCAAACGCT 


CCCCTAAGGC 


TTGCAACTCT TCTTCATTTT 


3180 


TTGTGTACAT 


ACTCTTATTA 


TACCAAAAAC 


TTTTCTTTTG 


TGTCTATTTT 


CCTACTAAAC 


3240 


TTATCATCAT 


AACATCCATA 


AAAAACAGGC 


TTTCTCTAAA 


AGAAAATGAG 


CGTAACAATG 


3300 


ACCAATACAA 


GATCTCGGAA 


AATATGACCA 


TAAAAGGAAA 


CTTCCTTCTT 


AACCGAATTT 


3360 


GGGACAAGAT 


AGGCTGCAAA 


AAACAAGCCC 


AGTCCAATAT 


AAATCAGAAG 


TGAGACAATG 


3420 


GTCATTGGAT 


TTCTTAAGAA 


AAGAAGTGTT 


GCTAAAATAG 


TCACCAACAC 


TGTcrrrrrr 


3480 


CTGTCCAGCA 


TAGCAAGAAA 


ATCGCGCACG 


TAttttttCA 


AGGGTAAAAA 


AATCAGCAAA 


J 34 1/ 


TCTAGCCCAA 


ATAGGAAAAA 


GAAGGATGGC 


AATAAAAAGT 


CAACTAATTC 


TTGCTGCAGC 


3600 


GTATTTTTGA 


TGAACAAGTT 


ATCTGACAAA 


ACAAGAACAG 


CTCCTAACAA 


ATTAATTAAG 


3660 


AGTAACATAC 


TGTAAAAAAG 


CTTCACCGAC 


TTCTTACTGG 


CTAGGACACT 


ATGGACTTCT 


3720 


TGCTTACGGG 


TATAAAGATA ATTTACTCCA 


GCACAGATTC 


CTGAAACGAA 


AACCATGCTT 


3780 


CCGATGAAAA AAGCTGTACT 


TTGTTTAAAG 


GACAAGATGC 


ATTCCTTCCA 


TAGGAAACAG 


3840 


CTACTCAAAC 


TGATTTGAAT 


TAAAGCTAAC 


AAAAATAAGA 


TTCTCATTGA TTTCATCTTC 


3900 


TCTCTCCCTT 


CCTACCAATC 


ATTATACTAG 


GAGAAAAGAG 


AGAACTGTTT 


CTAATCTTCT 


3960 
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CAAATGTCTC TTTAAGACGC TAAACAAACA CTAGAGACTA ATACTCAATG AAAATCAAAG 4020 

ATCAAACTAG GTAGCTACCC ACAGGTTGCT CAAAACAGTG TTTTGAGATT GCAGATAGAG 4080 

CTGACGTGAT TTGAAGAGAT TTTCGAAGAA TATAAATTTG AAATCATGAA AATCCGTCAA 4140 

ACGGGTGGTT GTTTTGTCTC GCACCTCACG GAGCGAGACG GACTCAGAGT CACATAATTA 4200 

TAAGGCTGAT AGTATTAATC TAACTATCAG CtTmCAGGTT ATTTAACGTT TCAGAAAAAC 4260 

TATAATGTCA AGATTAACTA AACAGTATCT AGTTCCTTCA AATAATTTTC TATCTTCATC 4320 

AACATTAAAG GATTGTTATA AATCTTACAT AACTCTCTTG CTTCTATATA ATAATTTTTG 4380 

ACTTGTTCTC TGTCTAGAAA TTTGGCTCCA GCATTTCCTA CAAGAATAAG TAGAGGAGCC 4440 

AATTGGTAGC TTGTCTGTCT TTGTTTACAG AGTTCAATCG TTTCAAGAGC TTCTTGGATG 4500 

GCTTCATTAT ATTTTTCCTT TGATACTAGG TAGTGAGCGT ACTTGTAACG AACTCTGATG 4560 

TAGCCAAATA AAAACTCTTG ATGGTCCAAA TTTTTTGTCT GATACAACTC TATTAAATGA 4620 

GAGTAGTTTG CCTCATATTC TTGTTCACGA CCCACTAAGG AATAGAAATT AGATAGAGTA 4680 

TTCAACGCCT TTAAATAAAT CAGAGTATTT GAAGAGACTT TTAATAATAT ATTTTCCAAT 4740 

GACGAAATTG CCTCACACTT ACTGTCATAT TGATAGAAGT CAATTATAGA TTTAATCCAT 4800 

TCAAGGTAAG TTCGGTCTTC TAATGTTAGA AAAGTGCTTC GTTCTACTTC TATTTTATAA 4860 

AGATATTCTA AATCGTCATA A7TTCTGTCA TCTAATAGGC GAGCAGATAG ATGTTTGAAA 4920 

TTAGAGAGGT TAGACTTAAC TTCGATTTGT TCATTGAAAA AGTAATCCAA AGGGACTTCA 4980 

AGTCGTTGAG AGAGTTTGAA TAACAAGTCT GCGGAGGGAA TAAAATGACC TCTTTCAATT 5040 

TTACTAATCT GCCTTTGTTC ACAAATTCCT TCTGCAAGAG TTTGTTGGGA GAGTCT 5096 
(2) INFORMATION FOR SEO ID NO: 205: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 
CD) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGTAA ACCTAAAAAG 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGGCATGGAC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GGAATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 240 
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GTTTTAATCC 


TAAGTGATCT 


CTTGGGAGGA 


TCCCCATTCA 


AGGTTTCTTC 


TACCATAATG 


300 


GGAGAAAATC 


CAGCCAAGAC 


AATGAATGTT 


CTCTCGGGTT 


TGAACTTAGC 


CATGTTAATG 


360 


GAAGCAGTCT 


TTGCTAGAAT 


GGCTCATAGC 


TTTGATGAGG 


TTGTTAATAA 


ATCAGTAGTG 


420 


GCGGCCCAGG 


GCGGAGTCGT 


AAATGGTAAA 


GAATTGTTTT 


CAACGGATGC 


AGAGGAAGAG 


480 


GAAGAAGATT 


TCGAATCGGG 


TATTTAAAGG 


GTAAAAGAAT 


GATAAAAAAG 


GTTACGATTG 


540 


AAAAAATAAA 


ATCGCCTGAG 


CGCTTCTTAG 


AACTACCACT 


TCTGACGAAA 


GAAGAAGTCG 


600 


GCCAGGCAAT 


CGATAAGGTT 


ATTCGGCAGT 


TAGAACTCAA 


CCTTGACTAT 


TTCAAGGAAG 


660 


ATTTCCCGAC 


GCCAGCTACC 


TTTGATAATG 


TCTATCCAAT 


CATGGATAAC 


ACGGAATGGA 


720 


CCAATGGTTT 


CTGGACAGGA 


GAACTGTGGT 


TGGCTTATGA 


ATACAGTCAA 


CAGGATGCAT 


780 


TTAAAAACAT 


CGCTCATAAA 


aatwttcttt 


CTTTCCTGGA 


TCGTGTCAAT 


AAGAGAGTAG 


840 


AATTGGATCA 


CCATGATCTC 


GGCTTCTTGT 


ACACACCGTC 


TTGTATGGCT 


GAATATAAGA 


900 


TAAATGGAGA 


TGGAGAGGCT 


AGAGAAGCAA 


CCTTGAAAGC 


TGCAGATAAG 


TTGATTGAAC 


960 


GCTATCAAGA 


AAAAGGTGGT 


TTTATTCAAG 


CTTGGGGAGA 


CTTGGGCAAG 


AAAGAGCATT 


1020 


ACCGTTTGAT 


TATCGACTGC 


TTGCTCAATA 


TCCAACTCTT 


ATTCTTTGCT 


TATCAAGAAA 


1080 


CAGGCGATCA 


AAAATACTAC 


GATATTGCAG 


AAAGCCATTT 


CTATGCTTCA 


GCTAATAATG 


1140 


TAATCCGTGA 


TGACGCTTCG 


TCCTTCCACA 


CCTTCTATTT 


TGATCCTGAG 


ACAGGTCAAC 


1200 


CCTTTAAAGG 


TGTAACGAGA 


CAAGGGTATA 


GTGATGATTC 


ATGCTGGGCA 


CGTGGTCAAT 


1260 


CATGGGGACT 


CTATGGTATT 


CCTTTGACTT 


ATCGTCACTT 


AAAAGACGAG 


tCCTGCTTTG 


1320 


AcrivrrvAA 


GGGTGTGACC 


AATTATTTCT 


TGAATCGTCT 


GCCAAAAGAT 


CATGTGTCCT 


1380 


ATTGGGATTT 


GATTTTTAAT 


GATGGTAGTG 


ATCAATCACG 


AGATTCTTCA 


GCAACAGCTA 


1440 


TCGCCGTCTG 


TGGGATTCAT 


GAAATGCTAA 


AACATCTCCC 


AGAGGTGGAT 


GCTGACAAAG 


1500 


ATATTTATAA ACATGCTATG 


CATGCCATGC 


TTCGTTCCTT 


GATCGAACAT 


TATGCAAATG 


1560 


ATCAATTTAC 


CCCTGGTGGG 


ACAAGTCTCC 


TCCACGGTGT 


GTACTCATGG 


CATTCAGGTA 


1620 


AAGGAGTGGA 


TGAAGGCAAT 


ATCTGGGGTG 


ACTACTATTA 


CCTAGAAGCC 


CTTATCCGTT 


1680 


TCTACAAAGA 


CTGGAACCTA 


TATTGGTAGG 


AGGAGAAATA 


TGACAATGCC 


AAATATTATT 


1740 


ATGACCCGTA 


TCGATGAACG 


GTTGATTCAT 


GGACAAGGAC 


AACTTTGGGT 


AAAATACCTA 


1800 


GGTTGTAATA 


CGGTCATTGT 


TGCCAATGAC 


GAAGTAAGCA 


CGGACAAGAT 


GCAACAAACT 


1860 


CTGATGAAAA 


CAGTTGTGCC 


AGACTCAGTT 


GCCATGCGTT 


TCTTCCCTTT 


GCAAAAGGTG 


1920 


ATTGATATCA 


TTCACAAGGC 


TAATCCTGCT 


CAAACGATCT 


TTATCGTTGT 


AAAGGATCTG 


1980 


AAGGACGCTT 


TAACCTTGGT 


AGAAGGTGGT 


GTCACTATCA 


AAGAAATCAA 


TATTGGGAAC 


2040 
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ATTCACAATG CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTGGG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGGAGC TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCTGTCA TTTGGTGTCG 2340 

GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG CCAATGGTGA ATTGG 2395 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3342 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
{D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

CCTTCTTTAG AGGTTAATTT TGCAAAATCG TCGATTGTTA TATAAGGATT ATTATAGAGA 60 

CTGTTCGCAA AGAATCTCTG ATATGTTTTT GAATCTTTTG AATACAAAAC TATCTCTCTA 120 

ATAGCATTGC CATCTGTTCC ATCAATTGGT AAACATACCG TAACTAGAAA AAGAATTATA 180 

TTCAAAATAA AAAATTCTGA TGCGTACGGC ACAAATCCCA AAAGTGCTAA TATTGCGACA 240 

ATTAGGTTAG CTCCACCTCC CCCAAAGAAG TAGAACACCA AATTCCTATC ACTATTTTTT 300 

TCATTAGTAA TGTTTCTATT ACTCATTTGA CAATAACCGA ATGCTAATAA CACTGGAAAT 360 

TTGAAATATA TTTTTTTTCT GAAATAGAAG AAAAAGGGAG TAGCAAGCAT CTCTAGTTTA 420 

TAAGATAAAC ATCTTCCCAC TAAAAAATGA CCTAGTTCAT GTAATGTAAT TGATATTAAC 480 

GAAATTAAAA TCAATCGAAA ATAATAGATT AATGAATCAT TTGGAAAAAT TATCAATAAT 540 

AGGAACAATA ACGGAATCAA ACATAAATAT ATGACAGAGT TATTTAATAT TTTCAACATA 600 

ATACCATTCC TCTAAACTAT TAGCTTCAAA AAGGCGTTTT TTCTCCCAAT ACATCTTCTC 660 

AAAATGTTCG GAATCATAAT TTTCTAAAAT TAATTTTAtG TCTGGTAAGC TCTTTCTTGA 720 

TAATCCGTTG TTTTGTACTT AATTTTCCCT TCAAGTACAT CTTCAATTTT ATAAGTTGCC 780 

TCCATCAACT GAGCCTCTGC AATATCTTTG AGTGAATTGG TAATTGAAAC TTGGTGTAAT 840 

ATCTGTCCts CCATATATGA AAATATATCT CTAAGATATT CTGACACATT ATCAGAGCCG 900 

TTACTCTCAG CAACATCTAA TGTTACAACA AACTTTCCAG CTAATCGAAA AAGATGGCTC 960 

CACCCCCCAA TCCTTTCAAT AAACTTTTTT GTGTCCACAG ATACGTTTTG TAAATATACA 1020 
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GGAGAAGAGA TAATTATAAT ATCAGACTCT AATAACTCTT TTTTTATAAC ACCTCCATCA 1080 

TCAGCATTAC TTTGCCTATC AATTCCTTTC TTAAACAACT CTTCTGAATC AGAATTAGAT 1140 

ATTTCTAGCT CTGAATTGAA AGGTGTCCTG AAAGATATAT CAACATTATT TCTACTAGAA 1200 

ATGATACTTG AAAGTCTCTT AGTATACTCT AAAGTCTTAG AGTTATGATT TCCCACTCCT 1260 

GCATATATAA ATATTTTATT CATTTTAATT CATCCTCTCA ATTTGAATTT AGTAGATTTT 1320 

TCAAGATAGT ATGGTACAAA AACAGACTTT TGTTGACTCA CATTATTACA TATGTTTTGT 1380 

ATTAAACCAA AATCAATACT ATTTTTGGAG TAATTTTGAT TTTAGTTTAA AATCATTTCT 1440 

ATAACAGTAG CATATACCTC AAGCCGTTTA GCAATTAGAA TAGAACTTTT CTTTATTATA 1500 

TTATTATCTC AACGAAAAGC TACACTATTA AAAATATTTT ATAGAATTAC ATATTAAACT 1560 

AGTCAATCTT GGTATTTTTA TATTGCTTAA TGAGTGGACA CCTCTATTTT AGAAACAAAA 1620 

CTATAAATTA AGCTAGATTT CAAGTAATGA GGGGATAACT ATCTTTTTGT CATTCTGATT 1680 

CAGTGCGATA TACCTTAAAA AAGTATAAGC AATACCAGTC ACACCTGTAT ACAAAGAAAA 1740 

ATCTGGGAAA TTGCTTGTTT GGACGATACG ATACTCTCCT TCTTTTGATT TATTCATTAC 1800 

AACACTACAC AATAAAGACT CCAATTCCAT ACTAGTATCC ATTTCTTTCA TGTAGTCGAT I860 

GTAAAAATTT ATTATGGCCA TACTTCCATG GCAAAATGTA TCATTATCTA AACTAGCTAC 1920 

AATTCCCTCT GGAACACTTT GGGGATGATT AACTAATGTC CCAAATTCTC CACTACACCA 1980 

CTTCAAAGAA TGAATTTTGA TTTTCTCCCT AGGAACTAGT TGTAAAATTA ATTCTTTATA 2040 

TTTTTTAAGT CTTGTCACTT TATAAATATT TTTTAATGTA AAAATTACAC CTGATAGTCC 2100 

ATGGCCAAAA CTATATCCAA AATTACTATT ATCTCTCTCG CTTACATCAT TATATAGCGT 2160 

ATCACCTAAA CTTAATACTA GCCTTAGAAC ACGTTCCTTC TCTATTCCTC TCCTATAATA 2220 

TCTTACCAGT GTATTAATTA AAGGTAGAAG ACCATTAATA TAGTCAGACT TCTTTGAAAC 2280 

ACTTGCAAAA TCAGTCTTTT CAAGCTCAGT TAAAAC ACTC TTTATATAAT TTAAGCATGC 2340 

GAGAGTATTT GTATCGTAAT CCTCTATAAT GGATAGAACA ATGAAATATC CTATATCCCC 2400 

AGTTAAACCA AATGTGGTCT TAGATAAAGA AACAGATGGC GGAATTGCAG ATAACATTTT 2460 

ATTGTACAGT TGAGTATATG ATGATTTATC TTTCAATAAT TTTACATAGT ACATAAACAG 2520 

TAATATTCCA GCTCPACCCC TATACATATC ATTmCCCGTT TGTTCAAGAC ACCATTTAGA 2580 

ACCTTTAAAA TTAACAGGTA TACTCCAAAT TGGATATTCG TCATAAATAT TATTAATAAC 2640 

CAAAGAGTCT GCAATATTTT CTACTTCATT ATGCAGAATA GTAACTAAAC TTTCATTTGG 2700 

GAGTTTTTTT CTATTAGATA AGTTTAATTT ATATCCTTTT TTTCGCTGAT CAAAGCTTGG 2760 

AAAATAAATT TCAATGATAT CAAGTTGCTT TTCTAAATTT TCCAAATTAT TATTAGGTAA 2820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTGATGTACG GAAATAAAAT GATTTCCAAA 2880 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 2940 

CATTTGTTCA CATTCGCTTG CTACAATACC TTGATTAGAA AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCTCTAT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GCATAGCTAT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AACCATACAA 3120 

GTTTGCTTTT AATAGCATTT TAAATTCTTC TGTTTTATTT AACTCTTCAA ATATCAGATA 3180 

AAAATCCCTA AAACCTTTTT TGAAATCTTT TATATACTTA TCAAATTCTA TATCACCATC 3240 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 3300 

ATCAGTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 3342 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



GAGAAAAGAA 


TGTTAAAGAA 


AAATGATATT 


GTAGAAGTTG 


AAATTGTTGA 


TTTGACCCAT 


60 


GAAGGGGCAG 


GAGTTGCCAA 


GGTAGATGGT 


TTGGTCTTTT 


TTGTAGAGAA 


TGCTTTACCG 


120 


AGTGAAAAAA 


TTCTCATGCG 


TGTCCTCAAG 


GTCAATAAAA 


AGATTGGCTT 


TGGAAAAGTT 


180 


GAAAAATACC 


TTGTCCAGTC 


ACCACACCGT 


AATCAAGATC 


TAGATTTGGC 


TTACCTGCGT 


240 


TCAGGAATCG 


CGGATTTAGG 


ACACCTTTCT 


TATCCAGAAC 


AGCTCAAGTT 


TAAAACCAAG 


300 


CAAGTCAAGG 


ACAGTCTCTA 


CAAGATTGCT 


GGAATTGCAG 


ATGTAGAAGT 


TGCTGAAACG 


360 


CTTGGTATGG 


AACATCCAGT 


CAAGTATCGC 


AATAAGGCGC 


AGGTGCCCGT 


TCGTCGAGTG 


420 


AATGGTGTCT 


TGGAAACAGG 


ATTTTTCCGT 


AAGAATTCGC 


ATAACCTCAT 


GCCCCTTGAA 


480 


GATTTCTTTA 


TCCAGGATCC 


TGTCATTGAC 


CAAGTCGTAG 


TAGCTCTTCG 


AGACCTGCTC 


540 


CGTCGTTTTG 


ATTTAAAACC 


TTATGACGAA 


AAGGAACAGT 


CTGGATTGAT 


TCGGAATCTT 


600 


GTGGTGCGTC 


GTGGTCACTA 


TTCAGGACAA 


ATCATGGTCG 


TTTTGGTGAC 


AACTCGTCCA 


660 


AAAGTTTTTC 


GTGTTGACCA 


ATTGATTGAA 


CAAGTTATCA 


AGCAGTTCCC 


AGAGATTGTG 


720 


TCTGTCATGC 


AAAATATCAA 


CGACCAGAAT 


ACCAATGCGA 


TTTTTGGTAA 


GGAGTGGCGC 


780 


ACTCTTTATG 


GTCAAGACTA 


TATTACGGAC 


CAGATGTTGG 


GAAATGACTT 


CCAAATCGCT 


840 
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GGCCCAGCCT TTTACCAAGT CAATACTGAA ATGGCGGAGA AACTCTATCA AACAGCCATT 900 

GACTTTGCAG AGTTAAAAAA AGATGATGTG ATTATTGATG CCTATTCTGG TATTGGAACC 960 

ATTGGTTTAT CAGTCGCCAA GCATGTCAAA GAAGTCTACG GTGTTGAACT GATTCCAGAA 1020 

GCAGTAGAGA ATAGCCAGAA GAATGCTTCT TTGAACAAGA TTACTAATGC CCACTATGTC 1080 

TGTGACACGG CTGAAAATGC CATGAAGAAA TGGCTCAAGG AAGGTATTCA ACCAACCGTT 1140 

ATCTTGGTTG ATCCTCCACG CAAGGGCTTG ACAGAAAGCT TTATCAAAGC AAGCGCCCAA 1200 

ACAGGAGCCG ATCGCATCGC CTATATCTCC TGCAATGTCG CAACCATGGC GCGTGATATT 1260 

AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC AGCCGGTGGA TCTATTTCCT 1320 

CAAACGCATC ACGTCGAGAC GGTAGCACTT TTGTCCAAAC TCGATGTCGA TAAGCACATA 1380 

AGTGTTGAAA TTGAGCTGGA TGAGATGGAT TTGACAAGTG CGGAGAGCAA AGCAACATAT 1440 

GCTCAAATCA AAGAATATGT TTGGAATAAA TTTGAATTAA AAGTTTCGAC ATTATATATT 1500 

GCACAGATAA AAAAGAAATG TGGAATAGAA TTACGAGAAC ATTACAACAA GTCTAAAAAG 1560 

GATAAACAAA TTATTCCACA GTGTACACCT GAAAAAGAAG AAGCCATCAT GGATGCTTTG 1620 

AGACACTTCA AAATGATTTA ATAGAAAAGA ATGACAGTAT ATGACTTTCT GCATTTATTA 1680 

CATTCCTACT TGGTATAGGA ACAGCTATTA TTCCTTTCTT GCAAGGTATC AATTAGAAAA 1740 

TAGGCTCAAT ATAAAGATTG ATAGGATCAT TTTTATATTT AAAGGAGCGT TGAAATGATT 1800 

GATAAAGGCA ACAAAAAATT TTAGGATAAA TTTGCTAAGT TGTATGCCTC TTTTATGAAA 1860 

AAAGATAAAG AGGTTTATGA TAAAGTTTGT GAATATCTTA GTCCTCATTT GAATAAAGAT 1920 

ATGGAGGTGC TTGAACTTGC TTGTTGGTTT CGTGTCATAA CAGTTATAGA GGCAAATAGT 1980 

TATGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA AAACTCAAAA AATAAATAGT 2040 

TGGTGTGCTG CTTACAATAT CCATTTTAAT AATGGATATT GTAAGCAGCA CCCCcAtGAA 2100 

TTTAAAGATT CTTTAAAGAG TCTTATTTTG TGATGAAAAT TTAATATGTA AATCTCAGAC 2160 

GATAGAAATT AAAAACTCTA TCGTCTTTTT TATACTCAAA ATTAGGAGGT AAAAATGGTA 2220 

AGGATAAGAG GTCCCACTTA AAACAATTTA TGGCAAAATA AGGACGGAAT AACACAACAA 2280 

ATTCTCTAAA ACAAATCACT AAATCAATGT AAGATTGAAT GAAATCAATA TTTATGCTAT 2340 

AATTAAATAA ATTTAATGAA GAAAAAAAGA GGGATATTAT GGCACTTAAC TATAAACCAT 2400 

TATGGATACA GTTAGCAAAA AAAGGACTAA AGAAAACAGA TGTAATAGCT ATGGCAGGAC 2460 

TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC AATTACATTT AAGAATTTAG 2520 

AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT TATTAGTTTT GAAGATAATT 2580 

TTAGTGACGA GGAATAGAAA ATGACTTTAA GGACAGAAGA TCAAGTTAGC GATTATGCAA 2640 
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GAGAAGTATA GGCTTTAATG AAGTTGAAGA AAACATCAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAG CCAGATGGTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2880 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 2940 

GATAATTATT ATCGATACAA TAGACTTGAA ATATGTTTAA GGAGTTTTTA TGAAAaCAAA 3000 

TTTTTTCTAA TmGCTATTTT AGCTATGTCT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TTCTAAAGAC CATGCGCCTG ATAAAATAGT TTTAGATCAT 3120 

GCTTTCGGTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTGTTG GATTTTCAAA AGCAAATTAC 3240 

GGTGTAAGTG CTGATAAAGG AGTTTTACCA TGGACAGAAG AAAAAATCAA AGAACTAAAT 3300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3360 

AAACCAGATG TTATCTTAGC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 3454 
(2) INFORMATION FOR SEQ ID NO: 208: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEONES5 : double 
(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAG AAAAAAATGA TTTTAACAAG TCTAGCCAGC GTCGCTATCT TAGGGGCTGG 120 

TTTTGTTACG TCTCAGCCTA CTTTTGTAAG AGCAGAAGAA TCTCCACAAG TTGTCGAAAA 180 

ATCTTCATTA GAGAAGAAAT ATGAGGAAGC AAAAGCAAAA GCTGATACTG CCAAGAAAGA 240 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATCATCAGAA 300 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 360 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 420 

TAAATCTGAC GCTGAATATC AGAAAAAATT AACAGAGGTC GACTCTAAAA TAGAGAAGGC 480 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT GCGTTGGCTG AGACTAAGAA AAAAGCACAA GAAGCTAAAG CAGAAGAAAA 600 

AGTAGCTAAG AGAAAATATG ATTATGCAAC TCTAAAGGTA GCACTAGCGA AGAAAGAAGT 660 

AGAGGCTAAG GAACTTGAAA TTGAAAAACT TCAATATGAA ATTTCTACTT TGGAACAAGA 720 

AGTTGCTACT GCTCAACATC AAGTAGATAA TTTGAAAAAA CTTCTTGCTG GTGCGGATCC 780 

TGATGATGGC ACAGAAGTTA TAGAAGCTAA ATTAAAAAAA G9AGAAGCTG AGCTAAACGC 840 

TAAACAAGCT GAGTTAGCAA AAAAACAAAC AGAACTTGAA AAACTTCTTG ACAGCCTTGA 900 

TCCTGAAGGT AAGACTCAGG ATGAATTAGA TAAAGAAGCA GAAGAAGCTG AGTTGGATAA 960 

AAAAGCTGAT GAACTTCAAA ATAAAGTTGC TGATTTAGAA AAAGAAATTA GTAACCTTGA 1020 

AATATTACTT GGAGGGGCTG ATCCTGAAGA TGATACTGCT GCTCTTCAAA ATAAATTAGC 1080 

TGCTAAAAAA GCTGAGTTAG CAAAAAAACA AACAGAACTT GAAAAACTTC TTGACAGCCT 1140 

TGATCCTGAA GGTAAGACTC AGGATGAATT AGATAAAGAA GCAGAAGAAG CTGAGTTGGA 1200 

TAAAAAAGCT GATGAACTTC AAAATAAAGT TGCTGATTTA GAAAAAGAAA TTAGTAACCT 1260 

TGAAATATTA CTTGGAGGGG CTGATTCTGA AGATGATACT GCTGCTCTTC AAAATAAATT 1320 

AGCTACTAAA AAAGCTGAAT TGGAAAAAAC TCAAAAAGAA TTAGATGCAG CTCTTAATGA 1380 

GTTAGGCCCT GATGGAGATG AAGAAGAAAC TCCAGCGCCG GCTCCTCAAC CAGAGCAACC 1440 

AGCTCCTGCA CCAAAACCAG AGCAACCAGC TCCAGCTCCA AAACCAGAGC AACCAGCTCC 1500 

TGCACCAAAA CCAGAGCAAC CAGCTCCAGC TCCAAAACCA GAGCAACCAG CTCCAGCTCC 1560 

AAAACCAGAG CAACCAGCTA AGCCGGAGAA ACCAGCTGAA GAGCCTACTC AACCAGAAAA 1620 

ACCAGCCACT CCAAAAACAG GCTGGAAACA AGAAAACGGT ATCTCGTATT TCTACAATAC 1680 

TGATGGTTCA ATGGCAATAG GTTGGCTCCA AAACAACGGT TCATGGTACT ACCTAAACGC 1740 

TAACGGCGCT ATGGCAACAG GTTGGGTGAA AGATGGAGAT ACCTGGTACT ATCTTGAAGC 1800 

ATCAGGTGCT ATGAAAGCAA GCCAATGGTT CAAAGTATCA GATAAATGGT ACTATGTCAA 1860 

CAGCAATGGC GCTATGGCGA CAGGCTGGCT CCAATACAAT GGCTCATGGT ACTACCTCAA 1920 

CGCTAATGGT GATATGGCGA CAGGATGGCT CCAATACAAC GGTTCATGGT ATTACCTCAA 1980 

CGCTAATGGT GATATGGCGA CAGGATGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2040 

CGCTAACGGT GCTATGGCTA CAGGTTGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2100 

CGCTAACGGT TCAATGGCAA CAGGTTGGGT GAAAGATGGA GATACCTGGT ACTATCTTGA 2160 

AGCATCAGGT GCTATGAAAG CAAGCCAATG GTTCAAAGTA TCAGATAAAT GGTACTATGT 2220 

CAATGGCTTA GGTGCCCTTG CAGTCAACAC AACTGTAGAT GGCTATAAAG TCAATGCCAA 2280 

TGGTGAATGG GTTTAAGCCG ATTAAATTAA ATCATGTTAA GAACATTTGA CATTTTAATT 2340 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATGTTC GTATTCTTTA GGTACCTATC 2400 

TTATGATTTC AGGAAATGTC ATTAAAAAAA CGACTCATTT TCTCTAACCT GAAAAATAGA 2460 

TTAGAGAAAA TGGGTTGTTT TATCTATTAT AGTTATTTGA ATGAAGmTAA GAAGAAGGTA 2520 

TACTCACATC ATTCACATAA TCTGTATATT GACTATAAGT TTTAAAAAAC AATTTTTAAG 2580 

CTCTTCCTTG TCTTCTCTAA CCAAGCGTGT TATAATGAAT ACTGCTCAAG CGACCTTCAA 2640 

TCGTGAAGCA CACACCACCT TCAATCGTGA ATAAACGAAT AGATGGGAGA CTTACCATGA 2700 

GTGATAACTC TAAAACACGT GTTGTCGTGG GGATGAGTGG TGCTGTTGAT TCGTCGGTGA 2760 

CGGCTCTTTT GCTCAAGGAG CAGGGCTACG ATGTGATCGG TATCTTCATG AAGAACTGGG 2820 

ATGACACAGA TGAAAACCGC GTCTGTACGG CGACCGAAGA TTACAAGGAT GTGGTTGCGG 2880 

TGGCAGACCA GATTGGCATT CCCTACTACT CTGTCAATTT TGAAAAAGAG TACTGGGACC 2940 

GCGTTTTTGA GTATTTCCTA GCGGAATACC GTGCAGGGCG CACGCCAAAT CCGGACGTTA 3000 

TGTGCAACAA GGAAATCAAG TTCAAGGCCT TTTTGGACTA TGCCATAACC TTGGGGGCAG 3060 

ACTATGTAGC GACTGGGCAT TATGCTCGAG TGGCGCGTGA TGAGGATGGT ACCGTTCACA 3120 

TGCTTCGTGG CGTGGACAAT GGCAAGGATC AGACCTATTT CCTCAGCCAA CTTTCGCAAG 3180 

AACAACTTCA AAAAACCATG TTCCCACTAG GACATTTGGA AAAGCCTGAA GTACGCAGAC 3240 

TAGCAGAAGA AGCAGGCCTT TCGACTGCTA AGAAGAAAGA CTCGACAGGG ATTTGCTTTA 3300 

TCGGAGAAAA GAACTTTAAA AACTTTCTCA GCAACTACCT GCCAGCTCAG CCTGGTCGCA 3360 

TGATGACTGT GGATGGTCGC GATATGGGCG AGCATGCAGG TCTTATGTAC TATACAATCG 3420 

GTCAGCGTGG CGGACTCGGT ATCGGTGGGC AACACGGCGG TGACAATGCC CCTTGGTTCG 3480 

TTGTCGGAAA AGATCTAAGC AAGAATATTC TCTATGTAGG ACAAGGATTC TACCATGATT 3540 

CGCTCATGTC AACTAGCCTA GAAGCCAGTC AAGTCCACTT TACTCGTGAA ATGCCAGAAG 3600 

AGTTTACGCT AGAATGTACG GCTAAATTCC GTTACCGTCA GCCTGACTCT AAGGTGACCG 3660 

TTCATGTCAA AGGAGAAAAG ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 3720 

CAGGACAGGC AGTTGTCTTT TACGATGGCG GG 3752 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3580 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



TATTTATATT 


TTTTTATCTC 


TGGCATACTT TGATACCTTT TTAGACTTAA AGTCTTTAAT 


60 


AGTGCCTTTC 


CACCTCTTTT 


TATCTATAAA GATTCTCCTA CATCATAATT CATTTTTTTA 


120 


TTTAAACCTT 


TCTGTCTTAG 


TTTGTCTTTA TCTTCTTCAT ACCATTTTAA GATTGTCACA 


180 


TAGTGGTTTT 


GATAGGTCTT 


ACCACTGCTT TCCATGTATC TGGATAGTTT ATTTATCATT 


240 


ATATCTGTGT 


GTGAGTTTAA 


TTTTTCTTTT AGATTTTTAT ATTCTTCTTT GCTTAACCTT 


300 


ACATTTTTGA 


ATTCTCCATA 


AAAAATGGGG GTGGACTTTT TATCTATCTC TCCCTCTCTC 


360 


TCTTTATCTA 


TCTCTATATC 


TTTCCATGTA ATTCCAATCT GGAGTACCTC TACTGTCTAT 


420 


CGGTAATTTA 


ATTTTGATAT 


CTGGCAATAC TGTGCTAGAT ATTTGATCTT TATATTCAGT 


480 


ATTTTTTAAA 


GCTTGCCTAA 


TAATTGAAGT TAAATAGAAT GCTACTTCTT TATTCAATTC 


540 


TTTATTTTPT 


AATTTTAAAC 


AATGAATTTT CATATCTAGG CTTGCTTTAT ATTTATGATA 


600 


AAAGACTGCT 


CCTAAAAATG 


AAACAGATAT AAAATTTTCA AAAACTCTAT AATTTTTATC 


660 


ATCTATATCT 


TCGTAGTAAC 


CTAAGATACC ATTGTCAATA TTTGTAGCAC TAATTCTAGG 


720 


AGTTTTTCCA 


TCGAGTAAAT 


ATCTTTTTGG AATAGATGAG CCTGTTGGTA CTTAACTCGA 


780 


TTTCCCCTTT 


TTTTCGGTAA 


TAAATATTTC TTTTTATTTT GTTGTCTGAT ATTTTTCCTA 


840 


CCTGTCCTTT 


GTAGGATGAG 


TATTTTCTAG ATTTTCyTGA ATAACTTTTT ACTTGAAGTT 


900 


TTAGCTTTTG 


AACTAGTCGT 


TGTACTTTCT TTTTGTTTAT TATCAGTCCT GATCTTTTTA 


960 


ATATTGCTGT 


TATTCTCTAT 


ATCCTATTTT TCATTCATGA TATTCTTTTA CTAATTTTAT 


1020 


CTTAAATTCT 


GTGCTGTATT 


TGCCATTAAA AAACTGACCT CCTTTAGTTA GrrTTTTGGC 


1080 


CTAACTTTTG 


AGGGTCAGTT 


CAAAATTTGC GACTTTTAAA TGAATTCCAA TATTCAATTA 


1140 


TTAAGAGTTA 


ACATGGTGCT 


TGCCAATAGG AATCATTAGA GGCGAATTGG AAATAGGGTC 


1200 


ACGTATAATT 


TTTGCTTCAA 


GATTAAAGAT ATCTTTAACT AGTTTATCAT TTAGTATATC 


1260 


TTCAGGCTTT 


CCCTCTGCAA 


CAAGTTTACC TTCTTTAATT GCAAATAGGT AATCAGCGTA 


1320 


TCTTGCTGTT 


AGATTTATAT 


• 

CGTGCAAAAT CATGCAAATG GTTGTCTTAT ATTTTTGGTP 


1380 


TAGATCAGTC 


AAGAGGTCTA 


ATAGTTCTAT TTGATATGAG ATATCCAAGT AAGTAGTTGG 


1440 


CTCATCTAAA 


AGTAGGATAC 


TTGTATCTTG GGCTAGGGCT AGAGCTATCC ATACTCTTTG 


1500 


CCTTTGACCC 


CCAGAAAGTT 


CTTCAACTAG GTTATTTGCT AGATCTTCAA CATTGGCCTT 


1560 


AACCATTGAT 


CTGTTTATTA 


TTTCAAGGTC ATCTTTTCCA AGACTCTTAA AAGGCTTTCT 


1620 


GTAGGGGAAA CGACCACGGC TTACAAGATC AGCTACTGTT ATTGATTCAG GGATTATTGG 


1680 


AGATTGAGGT 


AATATAGCTA 


TGTGTTTTGC TAAATCTTTT TCTTTATAAG AATTAATTGA 


1740 
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TTTATTATCA AGCAATACTT CTCCCTCTAA TCGCTTTATA AGTCGAGACA AGGTTTTAAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT GATATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA GATTATTTGA 1920 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT ACTGGATATC TAGCTGGTAA AATATTTTGA 2040 

GAGAATATGT CTGATAACAA AACTAGTAAA ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCAGCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2220 

GAAAGCTCGG GATTTGCTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 2400 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 2460 

AAACCAATAC CTAATATTAT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGCTAGTAAA 2520 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 2580 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2640 

ATTATATCAG GACTTGCAAG AGGATTTCTT AACATAGTTT GAAAGATAAA TCCTGCCAAT 2700 

GCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 2760 

GAAAAACTAG CTCCAGGAAC AGTTTCACTA TTTAAGACTT TAATCAAAGT TGAAAAAGAA 2820 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TGATTATTAT TAATAAAAAT 2880 

AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 2940 

ATTAGTTATT AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TTGCAGTAAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3060 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 3120 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGATCTTA 3240 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACCCATAGcT AAAGCGTTTA 3300 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 3360 

ATATAGAAGA TAATGTAGCT GCTCCAAGGC TACCTATTTG CCAAAATCTA AATTTGTCTA 3420 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC TTACAAAACT GCTTAAAGCC ATACTAACAC 3480 
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AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CACCAATCGC 3540 

GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 
(2) INFORMATION FOR SEQ ID NO: 210: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CCAAATTGCT CCACAATTAT TATGGAGTCG TCGTTTGGCA GATGGGCGTG ATATGTGTGC 60 

TCAAGAATCG TTGACAGGCA AGATATTGAC CCCCTATGAT ATGAATCGTA AGCAAATCGT 120 

CAATATTTTA ACCCGTCTTC ATCGCTCACG TCCGTTGATG ACACAATTGA GTCGTTTGGG 180 

CTATGCCATG GAAACACCTG TAGATTTACT ACAGTCTTGG CAGGAAACGG CTCCAGATGC 240 

TTTGCGTAAA AATCATTTTA TCAGTGAAGT GATGGCTGAT TTACGTCAGA CTATTCCAGG 300 

ATTTAGAGAG GACCATGCGA CCATTGTCCA TGGAGATGTA CGACATAGTA ATTGGATTGA 360 

GACAGATAGT GGCTTGATTT ATTTAGTAGA TTGGGATTCG GTTCGCTTGA CCGATCGCAT 420 

GTTTGATCTG GCCCATATGC TCTGCCATTA TATTTCAGAA CATCAGTGGA AGGAATGGTT 480 

GACCTACTAC GGTTACAAGT ACAATCAAAC GGTATTAAGT AAATTGTATT GGTATGGTCA 540 

ATTGTCTTAT TTGAGTCAGA TTTCCAAGTA TTATATGAAC CAAGATTTAG AAAATGTCAA 600 

TCGGGAGATT CATGGTTTGC GTCATTTCCG AGACAAGTAT GGAAAGAGAA GATGAGAGTT 660 

AGAAATCGTA AAGGGGCAAC AGAATTACTA GAGGCAAATC CCCAGTATGT GGTCCTCAAT 720 

CCCTTGGAAG CCAAGGCAAA ATGGCGCGAC TTGTTTGGCA ATGATAATCC CATTCATGTG 780 

GAAGTTGGAA GTGGAAAGGG TGCCTTTGTT TCAGGTATGG CCAAGCAAAA CCCTGACATC 840 

AACTATATCG GGATTGATAT TCAAAAGTCT GTTTTGAGCT ACGCTTTGGA CAAGGTGCTT 900 

GAAGTTGGAG TGCCTAACAT CAAGCTCTTG TGGGTAGATG GTTCTGACTT AACTGACTAC 960 

TTTGAAGACG GTGAGATTGA TCGCTTCTAT CTGAACTTTT CAGATCCATG GCCGAAAAAA 1020 

CGCCATGAAA AGCGTCGTTT GACCTACAAG ACCTTCTTGG ATACCTTCAA ACGTATCTTG 1080 

CCTGAAAATG GAGAAATTCA TTTCAAGACG GATAACCGTG GCTTGTTTGA GTACAGTTTA 1140 

GTGAGCTTTT CTCAATATGG CATGAAACTC AATGGTGTCT GGTTAGATTT GCATGCCAGT 1200 

GATTTTGAAG GCAATGTCAT GACAGAATAC GAGCAAAAAT TCTCAAACAA GGGGCAAGTT 1260 

ATCTACCGAG TTGAGGCAGA ATTTTAAGAG ATAACCTAAA ATTAGGCTGT ACAAGTGCTT 1320 
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1 i VjV» 111 Al. A 


TAAGTTGCCA 


AACGTGCTAT 


ACTGATAGTA 


AGAATATGAA 


AAGTGAGGCG 


1380 




TTCGCCTCTT 


GCTTATGAGG 


AGGTGGACGC 


AATCGCAACA 


ATCGTAGAAT 


1440 


1 AO iT. AUAuA 


AGTTGTAGAA 


CCTGTCATAG 


AAGCTCCTTT 


TGAACTCGTG 


GATATCGAGT 


1500 


ATGoAAAGAT 


TGGCAGTGAC 


ATGATTCTCA 


GTATTTTTCT 


AGATAAACCC 


GAAGAATTAC 


1560 


C*fl*fV* & li/V*Jk/"* 

t- 1 ITiAACwVv. 


ACGGCAGACT TGACAGAAAT TATCAGTCCT GTCCTAGACA CCATCAAGCC- 


1620 


AGATCCCTTC 


CCAGAACAAT 


ATTTCCTAGA 


AATTACCAGT 


CCAGGTTTGG 




1680 


A I. ft * Ik ft ft n 

GAAAACCAAG 


GATGCCGTCG 


CTGGAGCGGT 


TGGAAAATAC 


ATCCATGTCG 




1740 


ft ft ffUft^V ft Jfh 

AGCCATCGAT 


AAGCAAAAGG 


TCTTTGAAGG 


AACCTTGTTG 


GCCTTCGAAG 




1800 


^» ft ^^*f| ft f**^^^"* ft ft 

GACTATGGAA 


TATATGGACA 


AGACGCGTAA 


GAAAACCGTC 


CAAATTCCAT 


AWAU 111 AU I 


1860 


atcaaaagca 


CGTTTAGCAG 


TTAAATTATA 


GAAAAAGAAA 


GGATAGCTTT 

rh A r^\J^m AAA 


TCAfiflATTrA 


1920 


ft 1 ft § apptA ft ft n ft 

aaagtgaaga 


AAACATGAGT 


AAAGAAATGC 


TAGAGGCCTT 


CCGCATTTTfi 

«^ X^X* sm A A A m \J 


nAAnAArt2kr*A 


1980 


ft ^^f^^* ft ft PVft^ft ft ft 

agggaatcaa 


AAAAGAAGAT 


ATCATCGACG 


CAGTAGTAGA 

^* m ft- «ft^* A * 4 


GTCGCTTCCT 

XJ ft <>w\# ft ft V> W A 




2040 


>^ >S ft *m ft ^ftM Mil & 

gcagacgcta 


TGGTCAGTCA 


GACAGCGTAG 


V* ft- r\ A A • A 


CAACftAAAAA 


A\_AU4* luAL 1 


2100 


lftfH ft ft ANUfetilt A 

TTACAGTTTA 


TACTGTCCGT 


GAAGTTGTTG 


ATGAAGTATT 




1 L\AsAAA 1 L»A 


2160 


rfft^VfVUVl^ft ft ft. ft ^ft -m 

GCTTGAAAGA 


TGCTCTTGCC 


ATTAATTCAG 


CTTATGAACT 


TGGAGACAAA 


ATCAAGTTTG 


2220 


ft ft ft ft ^V-#ft ft ^fttfft 

AAGAAGCACC 


AGCTGAGTTT 


GGTCGTGTAG 


CAGCCCAATC 


TGCCAAACAA 


ACCATCATGG 


2280 


ft ft ft ft ft «in/«/w« 

AAAAAATGCG 


CAAgCAAACA CGTGCCATCA 


CTTACAATAC 


TTACAAAGAA 


CATGAGCAAG 


2340 


ft ft ft Atu^ft ft Mft^ftfVft^ft 

AAATCATGTC 


TGGTACAGTA 


GAACGCTTTG 


ACAACCGCTT 


TATCTATGTC 


AACCTTGGTA 


2400 


^ft-^ft" ft Mft#ft^ft ft. ft ^ft^ta 

GCATCGAAGC 


CCAATTGTCA 


AAACAAGACC 


AAATTCCTGG 


AGAAGTTTTT 


GCTTCTCATG 


2460 


ATCGTATCGA 


AGTTTATGTT 


TACAAGGTTG 


AAGACAACCC 


TCGTGGTGTG 


AACGTCTTTG 


2520 


f|M| ft ^^/V%#M ft jiM 

TTAGCCGTAG 


TCATCCAGAA 


ATGATCAAAC 


GTTTAATGGA 


GCAAGAAATT 


CCAGAAGTTT 


2580 


ATGATGGAAC 


TGTTGAAATC 


ATGAGCGTGG 


CTCGTGAAGC 


AGGTGACCGT 


ACGAAGGTTG 


2640 


CTGTTCGTAG 


CCACAATCCA 


AACGTGGATG 


CTATCGGTAC 


AATCGTTGGA 


CGTGGTGGTG 


2700 


CTAATATCAA 


GAAGATTACT 


AGCAAATTCC 


ACCCAGCTCG 


TTACGATGCT 


AAAAATGACC 


2760 


GCATGGTACC 


AATCGAAGAA 


AATATCGATG 


TTATCGAGTG 


GGTAGCAGAT 


CCAGCTGAAT 


2820 


TTATCTACAA 


TGCCATCGCT 


CCTGCTGAGG 


TTGACCAAGT 


TATCTTTGAT 


GAAAACGACA 


2880 


GCAAACGTGC 


CTTGGTGGTT 


GTTCCAGATA 


ACAAGCTTTC 


TCTTGCCATT GGTCGTCGTG 


2940 


GACAAAACGT 


GCGCTTGGCG 


GCTCACTTGA 


CTGGTTACCG 


TATCGATATC 


AAGTCTGCTA 


3000 


GCGAATTTGA 


AGCCATGGAA 


GACGCTGCTT 


CAGTAGAGTT 


GGAAGTAGAA 


AACGATACTG 


3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 3120 

AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTGATTT CCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 3300 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTGATCGC TTATGTGGAT CACAAAGTGA 3360 

AAAGAAGAGA GTTGCGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 3420 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 3480 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT GTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 3780 

GGATGTGAAA AGCCACTCAT CAAGTCTGGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 3840 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGTAAGTGC 3900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4020 

AGCTGTAGCC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 4200 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCGTCG 4260 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 4380 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4440 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA AGCTAACAAA CGCAAGGAAC CAGAGGAAAT 4 500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA GTTCAAGCAG TGGTTGAAGT 4560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGGTCCT AGAAAACAAC AAAAGAATCG 4680 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4800 

ATTCCATGAA TTGCCAACAG AATTTGAATA TACAGATGGT ATGACCGTTG CGGAAATCGC 4860 
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AAAACGTATC 


AAACGTGAAC 


CAGCTGAAAT 


TGTTAAGAAA 


CTTTTCATGA 


TGGGTGTCAT 


4920 


GGCCACACAA 


AACCAATCCT TGGATGGGGA 


AACAATTGAA 


CTCCTCATGG 


TGGATTACGG 


49B0 


TATCGAAGCC 


AAACAAAAGG 


TTGAAGTGGA 


TAATGCTGAC 


ATCGAACGTT 


TCTTTGTCGA 


5040 


AGATGGTTAT 


CTCAATGAAG 


ATGAATTGGT 


TCAGCGTCCA 


CCAGTTGTTA 


CTATCATGGG 


5100 


ACACGTTGAC 


CACGGTAAAA 

™ ■ m ^m> • • mm mm mm m 


CAACCCTTTT 


GCATACTCTT 


CGTAACTCAC 


GTGTTGCGAC 


5160 


AGGTGAAGCA 


GGTGGTATTA 


CTCAGCATAT 


CGGTGCCTAC 


CAAATCGTGG 


AAAATGGTAA 


5220 


GAAGATTACC 


TTCCTTGATA 


CACCAGGACA 


CGCGGCCTTT 


ACATCAATGC 


GTGCGCGTGC 


5280 


TGCTTCTGTT 


ACCGATATTA 




CGTAGCGGCA 


GATGACGGGG 


TTATGCCTCA 


5340 


GACTATTGAA 


GCCATCAACC 


ACTCAAAAGC 


AGCTAACGTT 


CCAATCATCG 


TAGCTATTAA 


5400 


CAAGATTGAT 


AAACCAGGTG 


CTAACCPAfSA 


ACGCGTTATC 


GGTGAATTGG 


CAGAGCATGG 


5460 


TGTGATGTCA 




GTGGAGATTC 
V7 & vunun 1 1\> 


TGAATTTGTT 


GAAATTTCGG 


CTAAATTCAA 


5520 


CCAAAATATC 


GAAGAATTCT 




CCTTCTTGTG 


GCTGAAATCC 


AAGAACTCAA 


5580 


AGCAGACCCA 


ACAGTTCGTG 




GGTTATCGAA 


GCGCGCTTGG 


ATAAAGGAAA 


5640 


AGGTGCGGTC 




TTfiTAPAACA 


AGGTACCTTG 


AATGTTCAAG 


ACCCAATCGT 


5700 


TGTCGGAAAT 






TATGACCAAC 


GACCTTGGTC 


GTCGTGTTAA 


5760 


AGTTGCTGGA 


CCATCAACAf* 




CACAGGTTTG 


AACGAAGCAC 


CGATGGCGGG 


5820 


TGACCACTTT 


GCCGTTTACfi 


ARCATRAAAA 


ATCTGCGCGT 


GCAGCAGGTG 


AAGAGCGTGC 


5880 


CAAACGTGCC 


CTCATGAAAC 


AACGTCAAGC 


TACCCAACGT 


GTTAGCCTTG 


AAAACCTCTT 


5940 


TGATACCCTT 


AAAGCTGGGG 


AACPCAAATf* 


TGTTAATGTT 


ATCATCAAGG 


CTGATGTACA 


6000 


AGGTTCTGTT 


GAAGCCCTTT 


CTGCCTCACT 


TCAAAAGATT 


GACGTGGAAG 


GTGTCAAAGT 


6060 


GACTATCGTC 


CACTCAGCGG 


TCGGTGCTAT 


CAACGAATCA 


GACGTGACCC 


TTGCCGAAGC 


6120 


TTCAAATGCC 


TTTATCGTTG 


GTTTCAACGT 


ACGCCCTACA 


CCACAAGCTC 


GTCAACAAGC 


6180 


AGAAGCTGAC 


GATGTGGAAA 


TCCGTCTTCA 


CAGCATTATC 


TACAAGGTTA 


TCGAAGAGAT 


6240 


GGAAGAAGCT 


ATGAAAGGGA 


TGCTTGATCC 


AGAATTTGAA 


GAAAAAGTTA 


TTGGTGAAGC 


6300 


GGTTATCCGT 


GAAACCTTCA 


AGGTGTCTAA 


AGTGGGAACT 


ATCGGTGGAT 


TTATGGTTAT 


6360 


CAACGGTAAG 


GTTGCCCGTG 


ACTCTAAAGT 


CCGTGTTATC 


CGTGATGGTG 


TCGTTATCTA 


6420 


TGATGGTGAA 


CTCGCAAGCT 


TGAAACACTA 


TAAAGACGAC 


GTGAAAGAAG 


TGACAAACGG 


6480 


TCGTGAAGGT 


GGATTGATGA 


TCGACGGCTA 


CAATGATATT 


AAGATGGATG 


ATGTGATTGA 


6540 


GGCGTATGTC 


ATGGAAGAAA 


TCAAGAGATA AGATTTTTTG 


CTCCTTTCTT 


AGGTGGTGAG 


6600 
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GGACGCAAGC 


AAACCGATGG 


TTTCATTGCT 


TATTTTTGAG 


CCTAGGGTCT 


CAAAAATCCC 


6660 


CTGTGATGGG 


ACTGATAAAT 


CAGTTCCATC 


ACTTTCACCA 


CGGCGAAAGA 


AGCAGATGAC 


6720 


TTCAAATTGA 


ACTTCGTTTC 


AATTTAAACT 


GAAAATCAAG 


AACTTTAAAA 


TAGCTAGGTC 


6780 


TGCTGGCCTA 


GCTTTTGGTT 


CAAAGTAGAG 


AAAGGAATAT 


CATGGCAAAT 


CATTTCCGTA 


6840 


CAGATCGTGT 


GGGCATGGAA 


ATCAAGCGTG 


AAGTCAATGA 


GATTTTGCAA 


AAGAAAGTCC 


6900 


GTGATCCACG 


TGTCCAAGGT 


GTGACCATCA TAGATGTTCA GATGCTGGGT 


GACTTGTCTG 


6960 


TTGCCAAGGT 


TTATTACACC 


ATTTTGAGTA 


ACCTTGCTTC 


GGATAACCAA 


AAAGCCCAAA 


7020 


TCGGGCTTGA 


AAAAGCAACT 


GGTACCATCA 


AACGTGAACT 


TGGTCGCAAT 


TTGAAATTGT 


7080 


ACAAAATCCC 


AGATTTGACC 


TTCGTCAAAG ACGAGTCCAT CGAGTATGGA AAjCAAGATTG 


7140 


ACGAGATGCT 


ACGCAATCTG 


GATAAGAACT 


AAAGAAGAGG 


GGTTGCCCCT 


CTTTTTTGGT 


7200 


GGAGGAAAAT 


AGGTTGAATT 


TGAAATGGAA 


AAATATTCTT 


TTATAATAGA 


TTGAAACTAG 


7260 


AATAGTACGC 


CTCTACTTCT 


AAAATATTGT 


TAGAAATCGA 


TTTGACTGTC 


CTGATCGATT 


7320 


TGTCCTGTTC 


TTGTTTCATT 


TTAATATAAA 


AAAGGGATTC 


TGTATTTTTT 


AATGTTATCT 


7380 


AATTAGAAAA 


TGCTTTTTTT 


GTAGGAAATA 


TAATATGATA 






7440 


GAGTTTGTAT 


ATGGCTGAAC 


AAGACTTAGC 


TATGCAAGTA 


TTGCAACAAG 


TGGTGAAACT 


7500 


ACCTGTTGTT 


AAGGTTGATC 


GTTCGAAATT 


TTTAGTGGAT 


AAGTTTTCCA 


AAGAATTGGA 


7560 


TCCAAAAGAT 


ATTCCTACCT 


TATTGGAACA 


AGGTCCAACG 


ACTCTTCTAT 


CTCAAGAAAT 


7620 


ATTAGATCGT 


GTAGCTAATG 


CTTGTATTCG 


GGACAATGTA 


TTATTAGCGA 


GTGGGACTTC 


7680 


TGTTTTGGCA 


GGATTACCTG 


GAGGGCTTGC 


TATGGCAATT 


ACCATTCCAG 


CTGATGTGGC 


7740 


TCAATTTTAT 


GCTTTCTCTC 


TGAAATTGGC 


TCAAGAATTA 


GGTTATATTT 


ATGGTTATGA 


7800 


GGATCTTTGG 


GCTTCACGAG 


AGGAGTTGAG 


TGAAGATGCT 


CAAAATACCC 


TCTTGCTTTA 


7860 


TCTAGGCGTA 


ATGTTAGGGG 


TGAATGGAAC 


CGCTGCTTTG 


CTACGTGTTG 


GTAGTATAAC 


7920 


AATTGCCAAA 


CAGGTAATGA 


AAATAGTGCC 


TAATAAAGCT 


TTAACAAAGA 


CGCTTTGGTA 


7980 


CCCTATTTTG AAAAAAGTCT TAAAAATATT TGGTGTGAAT 


CTTACCAAGG 


GAGGGTTGGC 


8040 


CAAAGGAATG 


GGGAAATTTA 


TTCCTATCTT 


GGGTGGTATC 


ATTTCAGGTG 


GTTTAACCTT 


8100 


TGCAACTATG 


AAACCAATGG 


GGGAAAGCTT 


GCAGAAAGAA 


TTATCCAAGC 


TAGTCAACTA 


8160 


TAGTGAAGTT 


CAATATCAAG 


AAGATGTTGA 


AACAATCCGA 


AAAGAGGCTG 


AAATCATCAA 


8220 


AGGAGAGTAA 


TATGAATCCT 


ATCAAAGCTT 


TTGCTAAAAT 


TTATGGTAAT 


TACTTTTTGA 


8280 


CCGTGCAAGG 


TGTAAAAGTG 


ATGAAAACGA 


TAAAGAAAGC 


TGACCATGTC 


GTTGTTGGTC 


8340 


TGGGGAAACT 


TTTTATTGCC 


GACAAGTTAA 


TGGATACGGC 


TCGGTGGCTC 


ATTAAGCCAG 


8400 
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AGGAGAGAGA 


ATGAAATTTT 


TTTGGTCTTC 


TTGCTATTCT 


TTTTATCAAA 


CCGATTATTG 


8460 


GGATTGTGAA 


ATTCTTTTGG 


ATGATCATCT 


CTTTTGCAGT 


CCAATTGCTG 


TTTTACAAGA 


8S20 


TAGTGTTTAA 


GATATTGGAT 


TGGCTCTTTA 


AACTTATCTA 


GATGGTAATC 


CAAGTTGCAG 


8580 


AGAACTAGCA 


GGAACTCCAC 


TGCTAGTTTT 


TTATTCTCTT 


TCCATATGGT 


ATAATATAAG 


8640 


CAGTAAAATC 


ATTTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAGCT 


TCACCTTGCA 


8700 


GTATATATGT 


TACTGACTTC 


GTCAGTTCTA 


TCCACAACCT 


CAAAACGGTG 


TTTTGAGCTG 


8760 


ACTTCGTCAG 


TTCTATCTAC 


AACCTCAAAA 


CACTGTTTTG 


AGCAACCTGC 


GGCTAGCTTC 


8820 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAACATAC 


AATGGAGGTC 


GTCATGGACA 


8880 


ATATCATCGA 


TGTGTCAATT 


CCTGTTGCAG 


AAGTGGTGGA 


CAAGCATCCA 


GAAGTCTTGG 


8940 


AAATTCTAGT 


GGAGTTGGGT 


TTTAAACCCC 


TTGCCAATCC 


CTTAATGCGC 


AATACAGTTG 


9000 


GTCGTAAAGT 


ATCACTTAAA 


CAGGGTTCTA 


AGCTAGCAGG 


AACTCCTATG 


GACAAGATTG 


9060 


TACGCACACT 


GGAAGCGAAT 


GGCTACGAAG 


TGATTGGATT 


AGACTAATGA 


CAGATGAACG 


9120 


GATTCATATC 


CTACGGGATA 


TTTTGTTAGA 


ATTGCACAAT 


GGCGCCTCTC 


CTGAGTCGGT 


9180 


TCAAGATCGC 


TTTGATGCGA 


CCTTTACGGG 


CGTGTCAGCC 


ATCGAGATTT 


CCCTTATGGA 


9240 


GCACGAGCTG 


ATGAACTCGG 


ATTCGGGCGT 


CACTTTTGAA 


GATGTTATGG 


AACTCTGTGA 


9300 


TGTCCATGCC 


AATCTTTTTA 


AAAATGCTAT 


CAAAGGTGTC 


GAAGTTTCAG 


ATACTGAGCA 


9360 


TCCAGGTCAC 


CCAGTTCGTG 


TCTTCAAAGA 


AGAAAATCTG 


GCTCTCCGTG 


CGCCCTTGAT 


9420 


TCGCATTCGT 


AGATTGTTAG 


ATACCTATGA 


GTCTATGGAA 


GACGAGGAAA 


TGCTGGCGGA 


9480 


GATGCGTAAG 


GGTTTGGTGC 


GTCAGATGGG 


ACTTGTGGGT 


CAATTTGACA 


TCCATTACCA 


9540 


ACGTAAGGAA 


GAACTCTTCT 


TTCCTATCAT 


GGAGCGCTAT 


GGACACGATT 


CACCTCCCAA 


9600 


AGTTATGTGG 


GGAGTGGATG 


ATCAGATTAG 


GGAACTCTTT 


CAAACAGCTC 


TAACGACAGC 


9660 


CAAGTCACTA 


CCAGAAGTGT 


CAATTAGCAG 


TGTAAAGGAA 


GCTTTTGAAG 


CTTTTGCGAC 


9720 


AGAGTTTGAA 


AGTATGATTT 


TCAAGGAAGA 


GTCCATCCTC 


CTCATGATTC 


TCCTTGAGTC 


9780 


TTTTACTCAG 


GATGACTGGC 


TTCAGATTGC 


GGAGGAGAGC 


GATGCCTATG 


GCTATGCCAT 


9840 


CATCCGTCCG 


TCAGAGAAAT 


GGGTGCCAGA 


ACGACAGAGC 


TTTATTGAGG 


AAAAGATTGC 


9900 


AGAGGAGCCT 


GTACAGCTAG 


ATACGGCAGA 


AGGTCAAGTT 


CAACAAGTCA 


TAGATACGCC 


9960 


AGAAGGCCAT 


TTTACCATTA 


CCTTTACCCC 


TAAGGAAAAG 


GAAGCTGTGC 


TGGACCGCCA 


10020 


TAGTCAACAG 


GCTTTTGGTA 


ATGGCTATCT 


TTCAGTCGAG 


CAGCCCAATC 


TCATCCTCAA 


10060 


TCATCTCCCT 


ATGGAGATTA 


CCTTTGTCAA 


TAAAGAAGAT 


ATTTTCCAGT 


ATTACAATGA 


10140 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACCCCG TCCCAAGTCG GGCGCAATGT 10200 

CGAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATGA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT ATGAAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 10320 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTGGAGTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTTCGTG GATTAGAATA 10440 

AGGAGAAAAA ATGACTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTGGT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAAGG 10680 

CTTGTTTGGT GAGCGAAATT ATTAAACGAG AAAGATTCTT GATTTTTCAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 10860 

TTTTCAAGTA CGGTGACTAA GGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CAGTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTGTAGAAG ATTTTGCAGC AAATTTGGGT 11040 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAAGTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAAGTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AGCCATTAGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 
(2) INFORMATION FOR SEQ ID NO: 211: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 211: 
CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTGACCTTTA TCCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 • 
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ACCTTTGGAG TGGTTCTCCA TTTAACCAAG CTAGTCGCGT GATGGGAGAA AATCCTGAGC 240 

GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT GATTCAAGCC TACACAGAGC 300 

GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCGCTGC TAATATCATT AAAGAAGCCA 360 

AAGATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT CGAAGAAGTT GCAAGCGCTG 420 

CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC TGTTATCGGA GACGGTAAAT 480 

TGAAAATCAA TCTTGCCCGT CTTGACACAC GTCTACTTCA CGGTCAGGTT GCAACTGCTT 540 

GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC AGATAACGTG GCTAAAGACG 600 

ACCrrCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA TGTCAAGGCT AACGTGGTTC 660 

CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT TGGAGAAACA CATGCCCTTA 720 

TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA AGGCGGCGTG CCAATCAAGA 780 

CTCTTAATGT TGGTTCTATG GCTCACTCAA CAGGTAAAAC ATTGCTCAAT ACCGTTTTGT B40 

CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG TGACTTGGGT GTTGAATTTG 900 

ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT AACAAAGCCA 960 

ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT GTCTATTATT TCTATGGTTT 1020 

TAGTAGTCGT TGTAGCCTTC TTTGCAGGTC TTGAAGGCAT CCTCGACCAG TTCCAATTTC 1080 

ACCAACCACT TGTAGCCTGT ACCCTTATTG GGCTTGTAAC AGGTCACTTG GAAGCAGGGA 1140 

TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGGTTG GTCAAATATC GGTGCTGCTA 1200 

TCGCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT TATCATGGTT CTTGGTGGTG 1260 

ACTTTACCAA GACTGGTATC GGTGTTGCCC AAGCGGTTGC TATCCCTCTT GCTGTAGCTG 1320 

GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG TTTGGTTCAT ACTGCAGATG 1380 

CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC GCATTTCATC GCGCTACTTT 1440 

TCCAAGGACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT TATGGTACCA ACTGAAACTG 1500 

TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA TGGTATGGCT ATCGGTGGTG 1560 

GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT GATGGCAACT CGTGAAGTAT 1620 

GGCCATTCTT CGCTCTTGGT TTCGTTCTCG CTGCTGTGTC AGATATTACT CTAATCGGAT 1680 

TCGGTGCTAT CGGCGTTGCT ATCGCTCTTA TCTACCTTCA CCTTTCTAAA ACTGGTGGAA 1740 

ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG CGATATCCTA GAAGACTACT 1800 

AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT CAATTAACTA AATCAGATCG 1860 

TAAAAAAGTT TGGTGGCGTT CAACCTTCTT ACAAGGGTCT TGGAACTTTG AACGGATGCA 1920 
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AAACTTGGGC 


TGGGCTTATA 


CACTCATTCC 


AGCTATCAAA 


AAACTCTATA 


CTAAAAAAGA 


I960 


AGATCAAATC 


GCTGCTCTTG 


AGCGTCACCT 


TGAGTTCTTC 


AAC ACT CATC 


CATACGTAGC 


2040 


TGCTCCAGTC 


ATGGGGGTTA 


CTCTTGCGCT 


TGAAGAAGAA 


CGTGCTAACG 


GTGTGGAAAT 


2100 


CGATGACGCT 


GCTATCCAAG 


GGGTTAAAAT 


CGGTATGATG 


GGACCTCTTG 


CTCGTATCGG 


2160 


TGACCCAGTA 


TTCTGGTTTA 


CAGTACGCCC 


AATCCTTGGA 


TepercGGTG 


CTTCACTTGC 


2220 


CCTTACTGGC 


AATATCTTGG 


GGCCACTCCT 


cttctttgtt 


GCATGGAACT 


TGATTCGTAT 


2280 


GTCATTCTTG 


TGGTATGTTC 


AAGAGATTGG 


ATACAAGGCT 


GGATCAGAAA 


TCACTAAAGA 


2340 


TATGTCTGGT 


GGTATCCTTC 


AAGATATCAC 


TAAAGGAGCT 


TCTATCCTTG 


GGATGTTCAT 


2400 


TCTTGCTGTC 


CTTGTTCAAC 


GCTGGGTAAA 


TATTAAATTT 


GCTTTCGATG 


TTTCTAAAGT 


2460 


TCAACTAGAT 


GAAAAGGCTT 


ATATCCATTG 


GGATAAATTG 


CCAGAAGGGT 


CTAAAGGTAT 


2520 


CCAAGAAGCA 


TTCGCACAAG TAGGACAAGG 


ATTGTCTCAA 


ACTCCTGAAA 


AAGTTACTAC 


2S80 


TTTCCAACAA 


AACTTGGATA 


TGTTGATTCC 


TGGATTATCA 


GGACTACTCC 


TTACTTTACT 


2640 


TTGCATGTAC 


TTACTTAAGA 


AAAAAGTATC 


TCCAATCACT 


ATTATCCTTG 


CCCTCTTCGC 


2700 


AGTGGGTATT 


GTGGCACATG 


TTCTTCACAT 


CATGTAATCA 


AGCAACTAAA 


AAGGAACCAG 


2760 


GTTCTAAAAT 


CTGATTCCTT 


TTTTCTATGC 


TTTTATTCAG 


CCAAGGCTCC 


CATTGGATCC 


2820 


CATGGTGCAA 


GTACGATTGG 


TTCTGCTCCA 


TAGGCAGCTT 


GTTCTTCTGC 


TGTCAGCAAT 


2880 


TCCTTACGAA 


CAACGATTTG 


GTATGTGTAT 


TCGTCCATCC 


AAGCGTCTGA 


GGCAACAAAG 


2940 


TAACCATCTG 


TACCGACCTT 


GTCTCCCCAT 


GAGTTTTCAA 


CCTTCCACTT 


GGTTGATTTA 


3000 


CCATTTTCGT 


CCAAGTCAAC 


ACCTGTCAAG 


ACCATGGCGT 


GGGTCATCAA 


GCTTTCACTA 


3060 


TAGTCCAAAC 


GTCCAGCCTT 


GTCTTGAGTA 


AGTTTAATGT 


CCATGCTTGA 


TTCAAAGTCA 


3120 


TAAACATCTG 


TCGCAAGGAT 


GCCAGCTTAC 


GGTTGCTGAG 


CTGGCCGACA 


TCAGAACCAA 


3180 


ACCAAACAGT 


CTCACCTGCT 


TGCATTTGGG 


CAATCGCCAA 


TTCTTTCAAG 


CGCTCCATTG 


3240 


GAACGTTGAT 


GTAGCGAACT 


GCACGGCTAC 


CAACCACATT 


CCCCAACATC 


TCAACTGTGT 


3300 


AAGATTTTCC 


GTAAGGTTTA 


TCAGCAGTTG 


GAGCATTGAT 


AACAGAAACG 


TAGTCTTCTA 


3360 


AAGGAAGATT 


GACATATTTC 


TTGTAAAACT 


CTTGTGGTGT 


GATTCCTTTT 


TCACTTTTGT 


3420 


AGTTGTTATC 


TTTATCGCGA 


TAAGCAAAGT 


CAAACTTGCG 


TGGTGGAAGT 


CCTAATGACA 


3480 


TAGCAAGAAA 


GTTAAAGATT 


TCTTGCAAGA 


GGTCTTCTTT 


CTTAGCTTGA 


ACAGTCGCTT 


3540 


GATCTGCACC AGAAACAAGC AAGTCACGCA AGATTTGAGC ATCTTGACGA AGCAATTTAT 


3600 


TAAGGATCGC 


ATTTAGCTCA 


CGACTGCTGC 


TAGATGAAAC 


AGACTCAGGA 


TAAACTGACT 


3660 


TAGGCACGAC 


ACCGTATTTT 


TCAAAGAGGG 


AAACGACCAT 


ATCCCATTGA 


CCGCCATCTT 


3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT CCGGCTAGTC AATTCTTCGT CTGAAGTCGC 3780 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGCGAGATG AGTTTGTGGC GGAAGGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CCCTTTCTGG TTAGTGACCT TGTCCTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 4080 

CTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 4156 



(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3902 base pairs 
< B > TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



AAAAACAACA 


AAATAAAACA 


AAAACAAAAA 


TATCGAGGTT 


TATTTTCAAA 


ACTTTCGATA 


60 


TTTTTATTAA 


GTTATTATTT 


TGTTGTTTCT 


AGTTTACTTT 


TTGATGGTTA 


ACAGTGGTGG 


120 


AGAATTATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTAGGAAG 


CTAGCCGCAG 


GCTGTACTTG 


180 


AGTACGGCAA 


GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


TCGAAGAGTA 


TTAGTGCAAA 


240 


CCGTAGTTGT 


AGTCATCATC 


TTGCATGGCT 


TCAACTTCGC 


CAAGAAGGTA 


ACCATTTCCG 


300 


ACTTGAGAGA 


AGAAGTCATG 


GTTGGAAGTT 


CCTGTTGAAA 


TACCGTTCAT 


AACGATTGGG 


360 


TTGACATCTT 


CAGCTGAATC 


TGGGAAAAGT 


GGATCTTGTC 


CCATGTTCAT 


GAGAGCTTTA 


420 


TTGGCATTGT 


AGCGAAGGAA 


GGTTTTAACC 


TCTTCAGTCC 


AACCAACACC 


GTCATAAAGA 


480 


CTCTCTGTGT 


AGCCTTCTTC 


ATTTTCATAA 


AGAGTATAGA 


GTAGGTCGTA 


CATCCATTCT 


540 


TTGAGTTTTT 


CTTGCTCTTC 


TTCAGGTAAT 


TCATTGAAAC 


CAAGTTGGAA 


TTTGTAACCA 


600 


ATGTAGGTTC 


CGTGAACAGA 


CTCGTCACGA 


ATAATCAATT 


TAATGATTTC 


TGCAACGTTG 


660 


GCAAGTTTGT 


TGTTACCGAG 


ATAGTAGAGG 


GGAGTGAAGA 


AACCAGAGTA 


GAAGAGGAAG 


720 


GTTTCGAGGA 


AGACGCTGGC 


AACTTTCTTT 


TCAAGTGGGC 


TGCCGTTTAG 


GTAGATTTCG 


780 


TTGACAATCT 


CAGCCTTCTT 


TTGTAGGTAA 


GGATTGGTAT 


TGGTCCATTC 


GAAAATTTCT 


840 


TCAATCTCAG 


CCTTAGTATT 


CAAGGTAGAA 


AAGATTGATG 


AGTAAGATTT 


AGCGTGGACA 


900 
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GATTCCATAA ATTGGATGTT 


ATTGAAGACA 


GCTTCCTCAT 


GTGGTGTACG 


GATGTCTGCG 


960 


CGAAGGGCTT GAACCCCAGT 


TTCAGATTGC 


ATAGTGTCAA 


GAAGGGTTAA 


ACCACCAAAA 


1020 


ACTTTTCCGA CCAAGTCTTT 


CTCTTTGTTA 


GATAGCTTTC 


TCCAGTCATC 


CAAGTCGTTT 


1080 


GATAAGGGAA TACCTGTATC 


GAGCCAAAAT 


TGCTCCGTCA 


GTTTTTCCCA 


AGTTGATTTG 


1140 


TCGATGACAT CTTCGATGGC 


ATTCCAGTTA 


ATGGCTTTGT 


AGTAAGTTTC 


CATTTAAAAT 


1200 


CTCTTTCTGT GTTTAGTATT 


GCGAACTCAC 


AATTATTTCT 


ACTTTACCAT 


AATTCTATAG 


1260 


GACTATCGCA CAAAAAGTCG 


GAAGCCCGAC 


TTTTAAAATG 


TTACATAAAT 


TATGTTATGA 


1320 


CATAGTAGAT TTGATTTTAT 


CAGTGCTGCT 


TAGGGAAAAA 


TAGTGTTTCT 


ATGCTAGAAA 


1380 


CTAAATCACA CAGCTTTCAC 


ATTGGTTGGC 


GCCGACTTCT 


CCACCGTCAT 


CTGTAAAGGT 


1440 


ACGGACGTAG TAGATAGACT 


TGATTCCCTT 


GTTAAAGGCA 


TAGTTACGAA 


GGATGGACAA 


1500 


GTCACGTGTC GTTTGTTTAT 


TTTCCCTCTT 


CCATTCGTAA 


AGGCCTTTTG 


GAATGTCACT 


1560 


GCGCATGAAG AGGGTGAGTG 


AAAGTCCTTG 


ATCCACGTGT 


TCAGTCGCAG 


CAGCGTAAAC 


1620 


ATCGATGACT TTACGCATAT 


CCATATCGTA 


GGCAGAAGTG 


TAGTAAGGAA 


TGGTTTCTGT 


1680 


AGACAAGCCA GCAGCAGGGT 


AATAGATTTT 


ACCAATTTTC 


TTCTCTTGGC 


GTTCTTCGAT 


1740 


ACGTTGCGTA ATCGGGTGGA 


TAGAAGCAGA 


AACGTCGTTG 


ATATAGCTGA 


TAGAACCATT 


1800 


TGGCGCTACA GCAAGGCGAT 


TTTGGTGGTA 


AAGACCATCT 


TCTTGAACCT 


TGTCGCGAAG 


1860 


TTCAGCCCAA TCAGCAACAC 


CAGGGATAAA 


GACATTTTTG 


AAGAGTTCTT 


TAACACGGTC 


1920 


TGATGTTGGA ACAAATTCAC 


CAGTTACATA 


CTTGTCAAAG 


TAACTTCCGT 


TAGCATAGTC 


1980 


TGATTTTTCA AAGTTGTGGA 


AGGTAATACC 


ACGTTCACGT 


GCAATATTGT 


TTGACTCTAC 


2040 


CAAGGTCCAG TAGTTCATAA 


GCATAAAGTA 


GATGCTTGTA 


AATTCAACAG 


ACTCAGGTGA 


2100 


ACCATATTCA ATGAGTTGTT 


GGGCAAGGTA 


GCTGTGCAGT 


CCCATGGCAC 


CGAGACCAAA 


2160 


GGTGTGGGCT TGGCTATTTC 


CATGGTCAAT 


CGTTGGTACA 


GCTACGATAT 


GTGAACTATC 


2220 


TGTAACGAAA GTAAGGGCAC 


GAACCATAGC 


ACGGATAGAA 


CGACCAAAAT 


CAGGTGAAGT 


2280 


CATCATGTTA ACCACGTTGG 


TTGAACCCAG 


GTTACATGAA 


ACATCTGTTC 


CCATTTGAAG 


2340 


GAATTCTTGA GCATCGTTGA 


TCAAGCTTGG 


TTCTTGAACT 


TGAAGAATCT 


CAGAACACAA 


2400 


GTTACTCATG ATAATCTTTC 


CATCAACAGG 


ATTTGCACGG 


TTAGCCGTAT 


CGATGTTGAC 


2460 


TACATAAGGA TAGCCAGACT 


CTTGTTGCAA 


TTTAGAGATT 


TCAGTTTCCA 


AATCCCGCGC 


2520 


CTTGATTTTT GTCTTGCGAA 


TATTTGGATT 


TGCGACCAAT 


TCATCGTATT 


TTTCAGTAAT 


2580 


GTCGATGTAA TTGAATGGCA 


CACCGTATTC 


TTTTTCTACA 


GAGTAAGGGC 


TGAAGAGGTA 


2640 


CATTTCTTCA TTTTTACGAG 


CCAATTCGTA 


GAATTTATCA 


GGTACTACAA 


CACCAAGTGA 


2700 
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TAGACTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 2760 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTCGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTCTT TCGATGTAGT TGTGTTCAAT 3360 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3420 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3480 

ACGGTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 3540 

AAAATTTCCC TTATCTAATT ACAAAAGAAA CGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3840 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3900 

GG 3902 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 
CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 



60 
120 
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CAATTGAGGT 


TGTTTACATA 


AAACTCTATA 


TCAGAGAAGC 


CTCATATAGA 


GTTTTTTCTT 


iao 


GCTAGTTTTA 


GGATTTTTTT 


GTAAAATAGA 


AAAAGTGAAG 


AGAGGTATGA 


AATGAGCAAG 


240 


AAAGATAAAA 


AAATCGAAAT 


TCAAGTAGCG 


GATGCCAAAG 


TTAATGTTGC 


TAAAGACAGT 


300 


TTTGAAGGTT 


ATACATTGAC 


TATCGGTAAA 


AAAGTTATCG 


GAGAAATTGC 


CGAATTAGAC 


360 


GGACAATTTG 


CCATTATAAA 


GAATGGCAAT 


GTCGATAGTT 


TTTATAAAAA 


ATTGGAAAAA 


420 


GCTGTGGAAA 


TTTTGATTGA 


AAATTATAAT 


TTAGCAAAAT 


AAGTCTTGTT 


TTTTTGAAAT 


480 


TTTCATGATA 


TAATAGTCCA 


TGTTGATTGT 


AGGAGAGATA 


GCGAAGAGGC 


TAAACGCGGC 


540 


GGACTGTAAA 


TCCGCCCCTT 


CGGGTTCGGG 


GGTTCGAATC 


CCTCTCTCTC 


CATTTCATTA 


600 


ATGGGGTATA 


GCCAAGCGGT 


AAGGCAAGGG 


ACTTTGACTC 


CCTCATGCGT 


TGGTTCGAAT 


660 


CCAGCTACCC 


CAGTTCTTAG 


GTAATAATCA 


AGATAGAAAG 


CAAAATATCT 


TAGGGTATTT 


720 


TATTTTTATA 


ATTGAAAGAC 


GTGAATGATA 


TGAACATGTC 


CTTGCGGGTG 


CTTAGGAAAA 


780 


AAATTATAAG 


TATGTCAAGT 


TTAAGAAAAA 


CTTGATTGTT 


GGAGGATTTT 


TTAGATGAAC 


840 


GAATTTGAAG 


ATTTGCTAAA 


TAGCGTTAGT 


CAAGTTGAGA 


CTGGTGATGT 


TGTTAGTGCT 

4 ^mr • m- m v w m ^^^^ # 


900 


GAAGTATTGA 


CAGTTGATGC 


GACTCAAGCT 


AACGTTGCAA 


TCTCTGGAAC 


TGGTGTTGAA 


960 

J w V 


GGTGTCTTGA 


CTCTTCGCGA 


ATTGACAAAC 


GATCGTGATG 


CAGATATCAA 


f VfZ h PTTTfyPT 


1020 

i V mm V 


AAAGTAGGAG 


AAGTATTGGA 


TGTTCTTGTA 


CTTCGTCAAG 


TAGTTGGTAA 


AGATACTGAT 


1080 

mm %0 


ACAGTTACAT 


ACCTTGTATC 


TAAAAAACGC 


CTTGAAGCTC 


GCAAAGCATG 


GGACAAACTT 

^mr^m*m mm^mf mm mm m^m' m m 


1140 

mm mm ^m 


GTTGGTCGCG 


AAGAAGAAGT 


TGTTACTGTT 


AAAGGAACGC 


GTGCCGTTAA 


AGGTGGACTT 


1200 

* mm V V 


TCAGTAGAAT 


TTGAAGGTGT 


TCGTGGATTT 


ATCCCAGCTT 


CAATGTTGGA 


TACTCGTTTC 


1260 


GTACGTAACG 


CTGAGCGTTT 


TGTAGGTCAA 


GAATTTGATA 


CTAAAATCAA 


AGAAGTTAAC 


1320 


GCTAAAGAAA 


ACCGCTTCAT 


CCTTTCACGT 


CGTGAAGTTG 


TTGAAGCAGC 


TACTGCAGCA 


1380 


GCTCGCGCTG 


AAGTATTCGG 


TAAATTGGCT 


GTTGGTGATG 


TTGTAACTGG 


TAAAGTTGCT 


1440 


CGTATCACAA 


GCTTCGGCGC 


TTTCGTCGAC 


CTTGGTGGTG 


TTGACGGATT 


GGTTCACTTG 


1500 


ACTGAATTGT 


CACATGAACG 


TAATGTATCA 


CCAAAATCAG 


TTGTAACTGT 


TGGTGAAGAA 


1560 


ATTGAAGTGA 


AAATCCTTGA 


TCTTAACGAA 


GAAGAAGGAC 


GTGTATCACT 


TTCACTTAAA 


1620 


GCAACAGTAC 


CAGGACCATG 


GGATGGCGTT 


GAGCAAAAAT 


TGGCTAAAGG 


TGATGTAGTA 


1680 


GAAGGAACAG 


TTAAACGTTT 


GACTGACTTC 


GGTGCATTTG 


TTGAAGTATT 


GCCAGGTATC 


1740 


GATGGACTTG 


TTCACGTATC 


ACAAATTTCA 


CACAAACGGA 


TTGAAAATCC 


AAAAGAAGCT 


1800 


CTTAAAGTTG 


GTCAAGAAGT 


TCAAGTTAAA 


GTTCTTGAAG 


TTAACGCAGA 


TGCAGAACGC 


1860 


GTGTCACTTT 


CTATTAAAGC 


TCTTGAAGAA 


CGTCCAGCCC 


AAGAAGAAGG 


ACAAAAAGAA 


1920 
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GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 1980 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 2040 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 2160 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 2220 

CATTGCGCTT GATAAGTTTG ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 2280 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 2340 

GAAAAATAGG ATGAACCTGC TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTG ATAGTGGTGT TTCAGG 2456 

(2) INFORMATION FOR SEQ ID NO: 214: 

li) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 10974 base pairs 
(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGGAT 60 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AACACTATAT 120 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 180 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTGAT AGAGAAATTT TTATGAATAA 240 

CTCAATAATT TTATAGTAAA TCATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 300 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 360 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 420 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCGA TGCTTATTTG 480 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 540 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAAGTAAAT 600 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTGACTC CTATTTATAT TTATGAGACA 660 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 720 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 780 

ACTGTTATAG AAGTACTAAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 840 
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TAAAGATACG ATGACGAGTG ACTTTTTCGA AGCTTGCTTC CAAAAATTCT TACTACCTAC 900 

TTTAGATACA CCATCCCTTA TCATTATGGA CAATGCAAGG TTTCACAGAA TGAACATGTG 960 

TAAGGAGCAG GG CAT AG ACT GTTACCACTT CCTACCTATT CACCCGAGTA TAATCCCATT 1020 

GAGAAAATAT GGGCTTACAT CAAAAACATC TCAGAATAAT ATTGTCAAAT TACGATGCTT 10 BO 

TTCTTGAGGC ACTTTTGTCC TATTCTTGTT TCAGCCGACT ATACTCCGTT ATTGGGCAGC 1140 

TACGGAACAG TCGATGGGAC GATGGGGGGA CATAAAAAAA TCCTCCACTT TTGTTTTTTA 1200 

TAACAGTATA CTGGAGAATT GACAATCTCG GTAGATACCT CGTTATAGCG CGGTTACTTA 1260 

TTAGGCAGTT ACAAAACAAC TGTGAACAGA AAACATTCCA GAGTCAGACA AGACTTTGGA 1320 

ATGTTTTGGC TCTATAATTT CTGTAGTGGG TAATCCCACC CCAGGAATTA TAGGGTCGTT 13 SO 

TCTTGTAGAA AAAAAGCCCC ATATGACCTA TAATGAAAAG CGTCTAACCA ACTCATTAGA 1440 

AAGGGTTCAT ATGGAACAAC TTAAGAATAC CACAGATTTG CTCGGATTGG AAGACAAAAA 1500 

TATCAAAATC TTGTCTGTTC TGAAATACCA AACCCATCTA GTCGTTCAGG CAAAGTTGCA 1560 

TTCCCCCGCT CCTCCTTGTC CTCATTGTCA AGGGAAGATG ATCAAATACG ACTTCCAGAA 1620 

AGCCTCTAAA ATTCCGCTTC TCGACTGTCA GGGTTTACCC ACGGTACTGC ATCTCAAAAA 1680 

GCGCCGCTTT CAGTGCAAGA ATTGCCTTAA GGTGGTCCTT TCTCAAACAT CCATTGTCAA 1740 

GAAAAATTGC CAGATTTCCA ACATGGTGAG ACAAAAAATC GCTCAGCTCC TCCTTGAAAA 1800 

GCAGTCTATG ACTGAGATTG CCCACAGATT GGCGGTCTCA ACTTCCACCG TCATCCGAAA I860 

ACTGAGGGAA TTTAAGTTTG AAACCCATTG GACCAAGTTG CCAAAAGTTA TGAGTTGGGA 1920 

TGAGTATAGC TTCAAAAAGA GCAAAATGAG CTTCATTGCC CAAGATTTTG AGTCCAAATC 1980 

CATCCTCGCA ATTTTAGACG GGCGAACTCA TGCGGTGATT CGAAACCATT TCCAACCCTA 2040 

TCAGAGAGAG GTTCGGGAGC TGGTCGAGGT CATCACCATG GACATGTACA GCCCTTATTA 2100 

TCGGCTCGCT AAGCAACTCT TTCCAAAGGC GAAGATTGTT CTTGACCGCT TCCACATTGT 2160 

CCAACATCTG AGCCGAGCTA TGAACCGAGT ACGAATCCAA ATCATGAACC AATTTGACCG 2220 

AAAATCCTTG GAGTATCGGG CGCTCAAGCG CTTTTGGAAC CCTCGCTTTT TCGTTTCTAG 2280 

GCTCGGGCTA AATCAGTCCA CTGGACTGAT TTACTACACC AGTATAGCTT CAAGCTCTGT 2340 

CAGAAACGAT TCTATCAGCC CACGTTTCGA ATGCACTTAA CCCATCGGGA AGTACGAGAT 2400 

AAGCTGCTTT CTTACTCTCA GGGATTACAG GTTCACTACG AACTCTATCA ACTCCTGCTC 2460 

TTTCATTTTC AAGAGAAGAA TGCCGACCAT TTCTTTGGAT TGATTGAGCA AGAACTGCCA 2520 

ACGGTTCATC CGCTTTTTCA AACGGTCTTT TGGACTTTTT TAAGGGATAG AGATAAGATT 2580 

ATCAACGCAC TTAAGCTGCC TTATTCCAAC GCTAAACTTG AAGCGACCAA TAATTTGATT 2640 
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AAGATTATCA AGCGCAAAGC CTTTGGTTTC CGGAACTTTA ACAATTTTAA AAAACGGATT 2700 

TTGATGACTT TGAACATCAA AAAAGAGAGT ACCAATTTCG TACTCTCCAG ATTGCAGCTT 2760 

TTCGCCTACC CACTACACTT GACAAAGAGC CACTCTTTAT TCCATGGTAT CAAAGGCAAG 2820 

ACTTGGTTTG GCATTGAGGT CCCAGCCTGC GAAGTTTTCT TTGTTCCACT CGCTGACGCT 2880 

GGCATAGGCA ATCATACCTG CATTGTCTCC GCAGAGTCGC AGAGGGGGGA TGATAACCTT 2940 

GACATCTGTG ATTTCGGCTG CTAGGCGTTC TCTGAGACCT TTATTGGCTG CCACACCACC 3000 

TGCCACAACT AGGATTTTAA CAGGATATTT CTCCAAAGCC TTCTTGGTTT TTGCCATGAG 3060 

AATGTCCATA ACTGCTGCTT GGAAGGAAGC ACACAAATCT TCTGTAGACA GGCTTTCTCC 3120 

CTTTTGCTCG GCATTGTGAT GAAGATTGAT AAAGGCAGAT TTCAAACCTG AGAAGGAGAA 3180 

CTCCAGATTA TCTTCCTTAA TCATGGCACG GGGGAAATCA TAAATATCCT GCCCCTGATG 3240 

AGCCAGCTCG TCAATCTCAC GACCTGCAGG ATAGGTCAAG CCCATGACAC GGCCGACCTT 3300 

ATCATAAGCC TCACCAACCG CATCATCACG GGTTTCCCCA ACAATCTTAT AATCTCCTGC 3360 

CTCCGAAACA TAAACCAACT CTGTGTGTCC GCCGCTGACC AAGAGGGCTA GCAAGGGAAA 3420 

CTCCAAAGGC TCCACACTCT GAGCTGCCAT GAGGTGCCCA GCCATGTGAT TAACAGGAAT 34 80 

CAGTGGAAGT CCGTGAGCCC AAGCAAAGGC CTTGGCAGCT GACAAACCAA CTAGCAAGGC 3540 

TCCGACCAAG CCTGGTCCGT AGGTAACCGC AACAGCTGTC ACGTCCTCTT CGGTAATCCC 3600 

TGCTTCTGCC AATGCCTCCT CGATACAGGC TGTAATGACC TCGACATGGT GACGACTGGC 3660 

TACTTCGGGC ACTACGCCAC CAAAACGTTT GTGACTCTCA ATTTGACTAG CAATGACATT 3720 

GGACAAGAGC TCATCGTCGT TTTTCAAGAC GGCGACACTG GTCTCATCAC AGGATGTCTC 3780 

AAATGCTAAA ATATATCTAT CCTTCATCTA TTTCTCTCTT CATGATAATG GCGTCCTCGA 3840 

CTGGGTCATG GTAGTAGGCC TTTCGCTCAG CGATAACTGT CATCTTTTCT TTCTTGTAAA 3900 

ATGCTTGCGC TCGTTGATTT GACTGTCTGA CTTCGAGGAA AATTTCCTTG TCTGTCGGCA 3960 

ATTGAGCAAA CAAGGcTGAC GCAATCCCCT GACCCTGATA AGCTCCTTTG ACAGCGATTT 4020 

GCAGGACTTC TGCTTCAAAA AGATTCTCCT GCACAGCTAG AAATCCAATC ACTTCTCCCC 4080 

CATCATAAGC CAATGCATAC CAAGTCTGGT CTTGGGACAG ATCTGCTTGG ATTTGCTCCA 4140 

GAGTCCAAGG ACTGACTAGG TAAACAGCTG CCATAACAGC GTAGATGGCT TGAGCTAGGT 4200 

CAGGCTGTTG TTGAATTCGC TTGATTTCTA TCATAGGCGT TTAATGTAAG ACTCGCCAGA 4260 

CTCGGTATGG TTCTTGAGCC AGTTTTCCTC AGCCTCGACT CGTTTGAGCT AATTCGGCAC 4320 

AAAATCATGC AAGGAGTCTG CTTCCTTGTC CCAGGCCAAA AGAGCTAGAT TAGCTGCATT 4380 
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GGGCAATGTT TCTTTGTAAT CAGTCCTTGG CAACTGTTTT TGAATCTGCT CAACAAAGGG 4440 

GCCAACTTCT CCGACAAAGG TTACCTGACT AGTACCCTTG ACTTTTTCTA CCACCTCTTC 4500 

AAAAGATAGG TGCGCTTCTG CCATGACAGG TTTGGCATTT TCATAAAATC CTGCATAAAC 4560 

ATTATTGCGA CGCGCATCCA TCAAGGGGAC AAACAAACCT TCTTGTTGAT GGGGCACCAG 4620 

AGCCAAGAGA CTCGACATAC CAACCAACTC GATGTTCAGG CTGTGAGCTA AGGTCTTAGC 4680 

AGTTGCTACC GCAATTCGCA AGCCTGTATA GCTACCCGGC CCTTCAGCTA CCACGATTCG 4740 

GTCCAAATCC TTGGGTGTCC AATCCAAACT TGCCATCAAA AAATCGATGG CAGGCATAAG 4800 

AGTAATACTG TGATTTTTCT TAATATTAAT CGTCGTCTCG GCAAGAACCT GCTTATCCTC 4860 

TAAAATAGCC AGAGAAAGAG CCTTGCTGGA CGTATCAAAA GCTAATACTT TCATAACACA 4920 

TTCCTATCTT TTTGTCTGCT TACTATTATA CTACAAAAGC TGGCACATGG GAATTTTCTT 4980 

TGCCCCCAGA CAAGAGTGCC CTCACTTAAC TAAAAATAAT TTAAAAAAAT GCTCACTTTT 5040 

CCTTTTCTTT TCCGAATATA AAAGTGAACA AGAAAAAAGG AGGAAAGTTC AATGACAAAT 5100 

TTTGACATTC TTGACAATCA ATTTTTATCC TTATCTGAAA ATGAATTATC AGATATTGAT 5160 

GGCGGTCTCG CTCCCTTGGT TATCTTTGGA GTAGCAGTAT CTTGGAAGGC TATTGCAGGT 5220 

GGAACAGCAC TTATAGGTTC TGGTTTGGCA GCTCGTTATT TTTTAGGAGG AGATTAATAT 5280 

GATGAAAGAT TTGAACAATT ATCGTGAAAT TTCTAATAAG GAATTGCAAG AAATCAAGGG 5340 

TGGCTTTGGT GTCGGTGTTG GTATCGCTTT ATTTATGGCA GGTTATACCA TTGGAAAAGA 5400 

CCTTCGTAAA AAGTTTGGTA AGTCATGCTA GATAAGAAAC ACATTTTTAG AAGGATAAAT 5460 

TTTATTGTCT TCATCTCTTA CAGTTTGCTC AGCATTCTCA ATGATTTGAA CATTACTACC 5520 

ATCCCTTTAC CATTCGATTT ATCTGTTTGT ATTGTTTTAT TTTTATGCTT CAACTCTATT 5580 

TTTGATCAGA ACAATGACTC CCATAAAAAT AATAAGCTTT GAAAATTCCA TTGTCATGTC 5640 

ATGTTAGAAA AATGCAAAGA CCACCTCATC TTGATAGATG GGGTGGAATT TTCGTGTCGT 5700 

AAATCTACTA TCTCTACATT CCCAAACAAA AAACCCCAGC ATAAGCAGGG CATCTAAGCA 5760 

TTTAATTCAA AGTAAAATAC AAACCAAACG ACATAGGTCA CGAGGAGGAG AAAAAGCGAG 5820 

TAGAGAGTCA CAAAGGTCAT TTTCCACAAG AACTTGCTTT GTCGTCGTTC CAGTTTGGCA 5880 

AATAGAAGAT TCCCCGCATA AACGCAAGCA ACAAAAACAA TAAAAGCTAC CAAGCGAGCT 5940 

CCGATAGCAA AAGCAAATAA GTTATACATA GGGCAACCTC CTTGACTTAA AATCTATATG 6000 

GAATTATGAC AAGCAATAAA TTTCACTTCC GTTATCAACA TAATACATTT TCTTTATTTT 6060 

TGAAAACGCT TACCAAAGAA ATCGTCCCCT AACTTTCTCG TTTCCGTCTT TTACTAATTT 6120 

TTCATTTTGT GGTATAATTG AAATAATTGT AACGAATCAA GGTCAATCTA GACACAAAAT 6180 
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GGAATGAAAT CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG TAAATAGAAA 6240 

GGAACTATAT GATTTACAAA CTTTTTTATC AAGAAACAAA AGAACGTAGC CCACGCCGTG 6300 

AAACAACACG CACGCTTTAC CTACACATCG ATGCCAGCTC AGAACTTGAG GGCCGTATCA 6360 

CTGCTCGCCA ACTTGTCGAA GAAAATCGCC CAGAGTACAA TATCGAGTAT ATCGAACTCT 6420 

TGTCTGACAA ATTGCTCGAT TACGAAAAAG AAACTGGCGC CTTCGAAATT ACGGAGTTCT 6480 

AATATGGCCT ACACTCTTAA ACCTGAAGAA GTCGGCGTTT TTGCCATCGG TGGTCTAGGA 6540 

GAAATCGGGA AAAACACTTA CGGAATTGAA TACCAAGACG AGATTATCAT CGTCGATGCT 6600 

GGGATTAAAT TCCCAGAAGA TGACTTGCTT GGTATCGACT ATGTCATTCC TGACTACTCT 6660 

TACATCGTGG ACAATATCGA CCGCGTCAAG GCTGTTTTAA TCACACACGG ACACGAGGAC 6720 

CACATTGGTG GGATTCCGTT CCTACTCAAG CAAGCAAATG TCCCTATTTA TGCTGGACCG 6780 

CTTGCCTTGG CTTTGATCCG TCGGAAACTC GAAGAACACG GCCTCTTGCG CAACGCCAAA 6840 

CTTTACGAAA TCAACCACAA CACCGAGTTG ACCTTTAAAA ATCTCAAGGC AACTTTCTTT 6900 

AGAACGACTC ACTCTATTCC AGAGCCTTTG GGGATTGTCA TTCATACTCC TCAAGGGAAA 6960 

ATCGTCTGTA CGGGTGACTT TAAGTTCGAC TTTACTCCAG TTGGAGAACC TGCGGACTTG 7020 

CATCGTATGG CTGCGCTTGG TGAAGAAGGC GTGCTCTGTC TCCTGTCTGA CTCGACAAAT 7080 

GCGGAAGTAC CAACCTTTAC CAACTCTGAA AAAGTCGTTG GTCAGTCCAT TATGAAGATT 7140 

ATCCAAGGTA TTGAAGGACG TATCATCTTT GCATCCTTTG CCTCAAATAT CTTCCGTCTC 7200 

CAGCAGGCAA CAGAAGCTGC TGTTAAGACT GGACGCAAGA TTGCGGTCTT TGGTCGTTCT 7260 

ATGGAAAAGG CCATTGTCAA CGGAATCGAT CTTGGCTACA TCAAAGCTCC TAAGGGAACC 7320 

TTTATCGAGC CAAATGAAAT CAAAGATTAT CCTGCAGGAG AAGTTCTTAT CCTCTGTACA 7380 

GGTAGTCAGG GTGAGCCTAT GGCAGCCCTC TCTCGTATCG CCAACGGAAC CCACCGTCAA 7440 

GTACAATTAC AACCAGGTGA TACCGTTATC TTCTCTTCTA GTCCCATCCC TGGAAACACT 7500 

ACTAGTGTCA ACAAGCTGAT TAACATCATT TCTGAAGCTG GTGTCGAAGT TATCCACGGT 7560 

AAAGTGAACA ATATCCATAC ATCTGGACAC GGTGGTCAGC AAGAGCAAAA ACTCATGCTC 7620 

TGCTTGATTA AGCCAAAATA CTTCATGCCT GTCCACGGTG AATACCGCAT GCAAAAAGTC 7680 

CACGCTGGAC TAGCAGTGGA TACTGGTGTT GAGAAGGACA ATATCTTTAT CATGAGCAAT 7740 

GGCGATGTGC TTGCCCTTAC TGCTGACTCA GCTCGTATCG CAGGTCATTT CAACGCCCAA 7800 

GATATCTATG TCGATGGAAA TCGTATCGGT GAAATTGGCG CAGCTGTCCT CAAAGATCGT 7860 

CGCGATCTAT CTGAAGACGG TGTCGTTCTG GCAGTTGCAA CTGTTGACTT CAAATCGCAG 7920 
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ATGATTCTAT 


CTGGTCCAGA 

^mw w> ^r^m* m* ^mf ^mr» m^mjm m 


CATCCTCAGC 


CGAGGCTTTG 


TCTACATGAG 


AGAGTCTGGC 


7980 


GACTTGATTC 


GCCAAAGCCA 


GCGTATCCTC 


TTCAATGCCA 


TTCGTATCGC 


ACTGAAAAAT 


8040 


AAGGATGCTA 


GCGTGCAATC 


TGTCAATGGT 


GCCATTGTCA 


ACGCTATTCG 


CCCCTTCCTC 


8100 


TATGAAAATA 


CCGAACGTGA 


ACCGATCATC 


ATCCCGATGA 


TCCTCACACC 


AGATGAAGAA 


8160 


TAAAGCAAGA 


AAACAGCCCC 


GTCCTCGGAG 


CaXjTTTTTCT 


CTATGCTTTC 


TTTTGAGATT 


8220 


AAAACTCATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


GCTAGCCGTA 


GGTTGCTCAA 


8280 


AGCACTGCTT 


TGAGGTTGTA 


GATAGAACTG 


ACGAAGTCAG 


TAGCCATACC 


TACGGCAAGG 


8340 


fTlACGTTGAC 


GCGGTTTGAA 


GAGATTTTCG 


AAGAGTATCA 


ATAAAAATCG 


AAATCAGACT 


8400 




GCGAAAGCAT 


AACTTGAGTT 


AGCTCCCATA 


GTTCGGGAAA 


CTATGGGAGG 


8460 






AGPTTTGAAC 

AAA wVlV 


TCATTCGTAA 


GAAGCCGACG 


ACGTATCATT 


8520 


1 lunl i 1 i hj 


A Aft AfiTTTT A 


GAAATACTAC 


GATTTTTACC 


TTCCAGATAC 


ACCATCAAAA 


8580 


lAunnnln I \- 


l luuu ill 


ACTCCCGAAA 


TACGGCTGGC 


TTGGCCGATG 

A A WW ^* wn A w 


GTTTCTGGAT 


8640 


luAlunul 1 1 






TTGCGATAGA 


ATCAATGTCA 


TCCCAGTCGA 


8700 




nn 1 v»W» nil 


Ivl 1^ V**» X V? w 


V AAA ^mf^% A W A A 


GGCAACCTGG 


TCCATGGCTT 


8760 




Irl.^ 1 1 V~/> 1 




TTTCAATCAA 

AAA + A \tfvl 


TTCGATAATC 


TTGTCATCCA 


8820 






ATGAAGGCCA 


CCACATCTTG 


GTAAGAAACT 


TCTGGACGGC 


88B0 




rf^*I*T*firtJ'*T( I'll' 1 


ACTGCATCGG 


TCAAGGGTTT 


GAAGCCCATC 


TCCTCAACCT 


8940 




1 1 L.<— 1 1\|A^ 1 


firtfT'PCIArtflT 


TGATACTGTC 

A A flW A W A W 


TAGGCGCTTC 


ATCTCATTAT 


9000 


LAnn X lun I 1 


1 1 1 \» 1 1 Vwt 1 1 


TCAAAACGAG 


CCCAGCGTTC 


ATCGTCCACA 


AGCCCAATCT 


9060 




f w T»f*AfiTr'AAfi 


CGCATATCAG 


CATTGTCATG 


ACGAAGAATG 


AGACGGTATT 


9120 




GGTCAAGAGA 


CGGTAGGGTT 


CAATGGTTCC 


CTTGGTCACC 


AAGTCGTCGA 


9180 




GATATAACCA 


TCACTGCGCT 


TCAAAATCAA 


TTCAGGCTTG 


CCTTGGATTT 


9240 


TCAGAGCCGC 


ATTGATACCC 


GCGATAATCC 


CTTGGCCTGC 


TGCCTCTTCG 


TAACCTGATG 


9300 


TTCCATTTGT 


CTGACCAGCA 


GTGAAGAGAC 


CTGAGATTTT 


CTTGGTTTCC 


AAAGTCGCAC 


9360 


GCAACTGATG 


AGGCAAGACC 


ATATCATACT 


CAATAGCATA 


ACCTGTCCGC 


ATCATCTCTG 


9420 


CATTTTCCAA 


ACCTTTGATG 


GAATGCACCA AGTCACGCTG 


GACATCCTCA 


GGCAGACTGG 


9480 


TTGAAAGTCC 


TTGCACATAG 


ACTTCCTCAG 


TATTGCGCCC 


TTCTGGCTCA AGGAAGAGTT 


9540 


GGTGACGTTC 


CTTGTCCGCA AAGCGCACAA TCTTGTCTTC AATCGACGGA CAGTAACGAG 


9600 


GCCCCACTCC 


CTTGACCACA 


CCTGTAAACA TAGGCGCACG 


GTGGAGGTTG 


TTTTGGATAA 


9660 


TCTCATGACT 


GGTACCATTG 


GTATAGGTCA ACCAGCATGG 


TACTTGGTCC 


TTGACATAAT 


9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 9780 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 9840 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AATAGAAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGAGTACTTG AGGTCTCCGA TGATAATTTC CCCACGGAGA GCAGTCCCTG 9960 

TCGTCACAAT AACAGCCTTA GCAGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 10020 

CCTTGCCATC TTCCACCAAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTGCGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCGCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 10200 

CAATGGTTTT GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTCG 10260 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CATTTCAATA TTGATGGTCG 10320 

CAAGCAGGAC CTTACAGCCC ATACGGCTAG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTCCTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC TAGCTGGATA TTCTGGAGCT TATCCAAGTC AATCCACTCA CAGGGCTGCC 10560 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10620 

ACTCGATATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAACC ACAAAAGCTA 10680 

GCTGCCCAGC TTGAGCTTTG ACACCCAGTT CTTCCTTCAC TTCACGGACT ACCGC G TCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAACCGTAT 10860 

TGTCTACTTT TGTCCGAAAG TCTTGTTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAACAGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 

(2) INFORMATION FOR SEQ ID NO: 215: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCCCTAGgCA ATCGTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 240 

CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 300 

CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 360 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GG A T TT GTAC AGACGATTCT 420 

AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 480 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 540 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 660 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 780 

ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 840 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 987 



(2) INFORMATION FOR SEQ ID NO: 216: 

(f) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 
{B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 240 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 360 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 480 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 540 
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CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAC CAAATCCGTA 600 

TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 660 

ATCACTCCAT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 

CAGACATAAA GTTTCTGATC CGTTGTCCAT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 7 BO 

GAGTACTCCA TAGTATATCC TGTCGCTCTT G TTTTAGGAA AGACAGAATT GTAGGGCATG 840 

ATGAAATAAA CTGGTAGTTC GG CAT CAT AC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 

AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG CATAACGGGC TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTGATA 1140 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 1200 

TCCACCTCCG TCTCCACCAA GTCTGGTTTG AGTTGTGCTG TAGTTTCCAA GGACTCTACT 1260 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAAGTTG AGGTAGATGA 1320 

ACCATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 1380 

ATGACACCCC ATCGCACGAT GTGATCTTTT TCTCTCCTAG GAAGCATATC CAGCTCCTTT 1440 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAAGTCA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AATACCAAGA ATCAGAGACT CTCTCTGAGT GATATCATCT 1560 

ACCAAAGTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 1620 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGAGATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAAGA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA GAAAAAGATA 1920 

AACAAGGCTT TAGCGCAAGT TAGCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 1980 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 2220 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT TATCAAGATC GAGGTAAATC 2280 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CGTAAACACG CGCCGTCATC CGAGCATCCT 2460 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA 2520 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2640 



CAGCATTGTA G 

{2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 5638 basa pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2651 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA GCTATGGAGG 60 

AAAATATCGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 120 

ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 180 

AAGAACAAGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 240 

AATTGGCTTT ACATGCCTTT GAAGAAACAG GACGTGGTGT ATTTGAAGAC AAAGCAACTA 300 

AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACAGTTGGCG 3 60 

TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 420 

GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 480 

TGAAGACACG TAACCCAATC GTCTTTGCCT TCCATCCATC AGCACAAGAA TCATCTGCTC 540 

ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTGTGTGC 600 

AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT TATGAACCAC GAAGGTGTTG 660 

CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGCTAAAC 720 

CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 780 

GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGGTATG GTCTGTGCAT 840 

CTGAACAAGC AGTTATCATT GATAAAGAAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 

CTTACCACAC TTACTTTGTA AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTCCTTCG 960 

GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 1020 
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AACCAGCAAC 


TTGGATTGCA 


GAACAAGCAG 


GATTTACAGT 


TCCAGAAGGA 


ACAAACATTC 


1080 


TTCCTGCAGA 


ATGTAAAGAA 


GTTGGCGAAA 


ATGAGCCATT 


GACTCGTGAA 


AAATTCTCAC 


1140 


CAGTTATTGC 


AGTTTTGAAA 


TCTGAAAGCC 


GTGAAGATGG 


TATTACTAAG 


GCTCGTCAAA 


1200 


TGGTTGAATT 


TAACGGTCTT 


GGACACTCAG 


CAGCTATCCA 


CACAGCTGAC 


GAAGAATTGA 


1260 


CTAAAGAATT 


TGGTAAAGCT 


GTTAAAGCTA 


TTCGTGTTAT 


CTGTAACTCA 


CCTTCTACTT 


1320 


TTGGTGGTAT 


CGGGGACGTT 


TACAATGCCT 


TCTTGCCATC 


ATTGACACTT 


GGATGTGGTT 


1380 


CTTACGCACG 


CAACTCAGTT 


GGGGATAACG 


TTAGTGCCAT 


TAACCTCTTG 


AATATCAAAA 


1440 


AAGTCGGAAG 


ACGGAGAAAT 


AACATGCAAT 


GGATGAAACT 


TCCTTCAAAA 


ACATACTTTG 


1500 


AACGTGATTC 


AATTCAATAC 


CTTCAAAAAT 


GTCGTGACGT 


TGAACGTGTC 


ATGATCGTTA 


1560 


CTGACCATGC 


CATGGTAGAG 


CTTGGTTTCC 


TTGATCGTAT 


CATCGAACAA 


CTGGACCTTC 


1620 


GTCGCAATAA 


GGTTGTTTAC 


CAAATCTTTG 


CGGATGTAGA 


ACCGGATCCA 


GATATCACAA 


1680 


CTGTAAACCG 


TGGTACTGAG 


ATTATGCGTG 


CCTTCAAACC 


AGATACCATC 


ATCGCACTCG 


1740 


GTGGTGGGTC 


TCCAATGGAT 


GCTGCCAAAG 


TAATGTGGCT 


CTTCTACGAG 


CAACCAGAAG 


1800 


TGGACTTCCG 


TGACCTTGTC 


CAAAAATTCA 


TGGATATCCG 


TAAACGTGCC 


TTCAAGTTCC 


1860 


CATTGCTTGG 


TAAGAAGACT 


AAATTCATCG 


CGATTCCAAC 


TACATCTGGT 


ACAGGATCTG 


1920 


AAGTAACACC 


ATTTGCCGTT 


ATCTCTGATA 


AAGCAAACAA 


CCGTAAATAC 


CCAATCGCTG 


1980 


ACTACTCATT 


GACACCAACT 


GTGGCAATCG 


TAGATCCTGC 


TTTGGTATTG 


ACAGTTCCAG 


2040 


GATTTGTTGC 


TGCTGATACT 


GGTATGGACG 


TATTGACTCA 


CGCGACAGAA 


GCATACGTAT 


2100 


CACAAATGGC 


TAGTGACTAC 


ACTGATGGTT 


TAGCACTTCA 


AGCCATTAAA 


TTGGTCTTTG 


2160 


AAAATCTCGA 


AAGCTCAGTT 


AAGAATGCAG 


ACTTCCACTC 


ACGTGAGAAA 


ATGCATAACG 


2220 


CTTCAACAAT 


CGCTGGTATG 


GCCTTTGCCA 


ATGCCTTCCT 


AGGTATTTCT 


CACTCAATGG 


2280 


CCCATAAGAT 


TGGTGCGCAA 


TTCCACACAA 


TCCACGGTCG 


TACAAATGCT 


ATCTTGCTTC 


2340 


CATACGTTAT 


CCGTTACAAC 


GGTACACGTC 


CAGCTAAGAC 


AGCAACATGG 


CCTAAGTACA 


2400 


ACTACTACCG 


TGCAGATGAA 


AAATACCAAG 


ATATCGCACG 


CATGCTTGGA 


CTTCCAGCTT 


2460 


CTACTCCAGA 


AGAAGGGGTT 


GAATCTTACG 


CAAAAGCTGT 


CTACGAACTC 


GGTGAACGTA 


2520 


TTGGGATCCA 


AATGAATTTT 


AGAGACCAAG 


GAATTGACGA 


AAAAGAATGG 


AAAGAACATT 


2580 


CTCGTAAATT AGCCTTCCTG 


GCTTATGAAG 


ACCAATGTTC 


ACCAGCTAAC 


CCACGTCTTC 


2640 


CAATGGTAGA CCATATGCAA 


GAAATCATCG 


AAGATGCATA 


CTATGGCTAC 


AAAGAAAGAC 


2700 


CAGGACGCCG 


TAAATAATTG 


TTTATCAGTC 


TAGAAGCAAG 


ACAAAAACTC 


AATTTGAGGG 


2760 
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AAAGATCCAG 


; TAATTTTTCT 


1 ATGATAAAAG 


GCATCCTATC 


' AAGGTTTTTG 


AACACCTGAT 


2820 


AGGATGCCTT 


' TTTATGATAT 


TGAGGCCTTT 


TTGCCCTTTT 


TGAAAAACTA 


GAATAGAAAC 


28B0 


AAAATATATA 


ATAGATTGAA 


ACTAGAATAG 


TACATATCTG 


CTTCTAAAAC 


ATTGTTAGAA 


2940 


TTCGATTTGA 


CTGTCCTGAT 


CGATTTGTCC 


TGTTCTTATT 


TCATTTTGAT 


ATATAAAAAA 


3000 


TATAGTATAG 


TAGACTGAAT 


CTAAAATAGT 


ACGAAACAAT 


TGCTAAAACA 


TTTATAGAAA 


3060 


TTAATTTTAC 


TTTTCTGATA 


GAGTTGTTCA 


CATCTTATTT 


CAATTCACTA 


TAGTTTAATT 


3120 

J 4, * \J 


TAAGAGTAGT 


ATTTACTAAG 


GCCCAATTAA 


AATCAAAGAG 


CAAACTAGAA 


AACGAjGTGCC 


3180 

J 4> O W 


ATTCAGCTCA 


AAACACTGAT 


TTGAGATTGC 


AGATAAGACT 


AGCCCCCTCA 


TTAACAGATT 


3240 

J4tU 


TACGATAAAA 


CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATG AT AG AT 


3300 

J J V w 


CTCTATAAAA 


TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTCGTTTT 


AAAAT ACT A T 


33fi0 

. J J U 1/ 


CTCAGATATT 


CTTATATCGA 


CAAGAAGTTT 


TTGAGTCATT 


CCCTCATCAT 


ACATATTAAA 


3d20 


TAAATAGTGG 


CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


JVOU 


AGATATAAAC 


AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AG AAA fTJlf* A 


J J4U 


GTGGACTATT 


CTAGATTCAA 


CATATTATAA 


AAACTAGAGT 


AAAAGAAAAG 




JDUU 


TGTGTAATGC 


AGGATCCAAT 


CCTTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTfVTAr'AA 


iccn 

JDDU 


TGTAGTCGTC 


ATTAATAAAG 


ACAGATGGGA 


ATGACAGTGT 


TCCTATTTAT 


TTTY! AT An AG 

4 4 A KMf\ 4 AUHU 


3720 


ATCGATGAAT 


TCTTTAGATA 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGT^TTOT'Gf* 


37B0 


ATGCATAAAT 


AGCAAGGAGA ATCCTATTTT TTCTCCAGTA 


GCTTCTTTTT 


GTATGAGATT 


3 AdO 


AGAGTGAATC 


TTGTCCGCTT 


CTACTAAGGA 


GTCTTCCGCT 


TCTTCAACTT 


TAATTTTCGf* 


3900 


TTCTTTTAAA 


TTTCCTGCCT 


TAGCTAGTTG 


GATGGCTTCA 


ATAAAGGATG 


ATTTfifiPTfiP 

« * * 4 4 VfW 


J ? w w 


TCCACTATTG 


GCAATGAGCT 


GAAAACAGAT 


ATATTCCATT 


TCTTCTGTCA 


TCTTATTTfT 


4020 


CCTATCCATG 


CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


GTAATCCCGC 


4080 


Alfilunnluu 


TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG 


GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


TATTCTTTTT 


AGCTTCTGTC 


4200 


ATTGATTTTG 


CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCGCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA 


TACTTGTCGA 


CATTCCCGCA 


TTACATACPA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC 


ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATGTATAG 


4380 


AAGAACTCCA 


AGTGCAAAGA 


TGATGAATTG 


AACTAGAACT 


GCTCTCACGT 


CCCCTGCTGT 


4440 


TGCTAACCAT 


GCATTTAAGA 


ATACTGGTGT 


AGTCCAAGGA 


ACTTGTATAA ATGCAGGACT 


4500 


CATGAATTCT 


GTAACTGTTG 


CTAAGTAGCT 


GATTAAAATA 


CCAAGGACTG 


GAACTGTGAT 


4560 
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AAATGGAATA CCTAATGAAA TGTTATAAAC GATTGGGTAA CCGAATAATA CTGGTTCATT 4620 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGACA CAGCATTGCG 4680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4740 

GAATGTTTGT ATTTGTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4600 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 4860 

TGCTTGGTGA ATACCAAATA GCCATAACAT ATTTCCTAAA GAGTAAATAA TAATGACCCC 4920 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 5040 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCTACTAACA GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACGTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CGtACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 5220 

GAAATGTTTA CCGCATCTTT TGCTCCGTCA GGAACTACAG AAACTGTATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACG GGTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGGGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 5638 
(2) INFORMATION FOR SEQ ID NO: 218; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4745 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
< D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTGCCTA AAGAGATTGC 


AGGTTTGATG 


CTTTCAGGTT 


TGATTTCAGA 


TACCCTTCTT 


360 


TTGAAATCAC 


CAACAACACA 


CCCAACAGAT 


AAAATCATTG 


CTCCTGAATT 


GGCTGAATTG 


420 


GCTGGTGTGA 


ACTTGGAAGA 


ATATGCTTTG 


GCAATGTTGA 


AAGCTGGTAC 


CAACTTGGCT 


480 


AGCAAATCTG 


CTGAAGAATT 


GATTGATATC 


GATGCTAAGA 


CTTTTGAACT 


CAACGGAAAT 


540 


AATGTCCGTG 


TTCCCCAAGT 


GAACACAGTT 


GACATCGCTG 


AAGTTTTGGA 


ACGCCAAGCA 


600 


GAAATTGAAG 


CTGCAATGCA 


AGCTGCCAAC 


GAATCAAACG 


GCTACTCTGA 


AAA A W A A W 




ATGATTACAG 


ATATCGTCAA 


CTCAAACTCA 


GAAATCTTGG 


CTCTTGGTGC 


CAATATGGAC 


720 

f * V 


AAGGTCGAAG 


CGGCTTTCAA 


CTTCAAACTT 


GAAAACAATC 


ATGCCTTCCT 


TGCTGGTGCC 


780 


GTTTCACGTA 


AGAAACAAGT 


GGTACCTCAA 


TTGACTGAAA 


GCTTTAATGC 


CPAAfiATTTT 
w a w% i i i i 


OlU 


GGGTGTCAGC 


TCAAAATCGG 


AAAGTCTAGT 


TTGCCTTATA 

4 * ^v^v^b 4 A 4 * 4 m •%. 


TCGCAAGGAG 


AAA V«xJ%J%^ A W>v 




TTTTTTCTAG 


GAGTGAAGTA 


TGTTAGAAAA 


TGGCGATTTG 


ATTTTTGTGA 

■ » * A A 4 A \* A Un 


GAGATGGGTC 


J W U 


AGACATGGGA 


CAGGCCATCC 


AGACTTCCAC 


AGGTAACTAT 


AGCCATGTTG 

•T4i#^»^»*4 A A 4 


C f ATTTATTT 


1020 


GGATGGGATG 


ATTTATCATG 


CTAGTGGACA 


GGCTGGTGTT 


GTCTGTCAAG 


AACCGGCAGA 


1 OHO 


CTTCTTTGAG 


TCCAATCATT 


TATACGACCT 


CTATGTTTAC 


CCAGAAATGG 


ATATCCAGTC 


1 1 dO 


GGTGAAGGAA 


AGAGCTTGCA 


AACATCTTGG 


AGCACCCTAC 


AATGCTTCTT 

• A A A X* A A 






TGCAGCTGGT 


TTTTACTGCT 


CCCAGTATAT 


AGCAGAAATC 


CTACCTATTT 

X» A N» A J» A A A 




1260 


TCCTATGAAA 


TTTGGAGwTG 


GGGAGCAGGA 


GATTAGTGAT 


TTTTGGAGGG 

A. A, A A X^XJJ^VWTX^ 


ART ATT Af* AT 


1320 


AGAACTAGGT 


CTGCCTGTTC 


CTCTGAACCA 


AGCTGGTACC 


AATCCTAGTC 


Av & A VJTVJx>a^wW 


1JOV 


ATCGCCTCTG 


TTACAATGTA 


AAGAAAGGAA 


TCTTCATGAT 


TCAGATTTTT 

* \MT\ 4 4 4 4,^ 


A V\»rVl W A W 


1440 


GTTTGACGAG 


ACAGCCATTT 


TTGGAGAATT 


GATCCGCTAT 


CTGGATCAGT 


ATGAGGATGT 


1SO0 

4> ^ W W 


GATTCTACGG 


GAAATTAAGG 


CTCAATTTCC 


AGATGTTGCA 


GTTGATAAAC 

* a vn a> »a»w%x« 


TCATGGAAGA 

A ^»a^* 4 wwrVnlo 


1560 

dA> «^ W W 


GTATATAAAG 


GCAGGCTTGA TTCTACGTGA AAATAAGCGC 


TATTACCTCA 


ATTTT CCT AC 

*• A 4 A, 4 V* W A 


1620 

^ W A> W 


GCTTGAATCA 


CTTGATAGTC 


TTGAACTGGA 


TCAAGAGATT 


TTTGTCAGAG 


AAGCTAGTCC 


1680 


GGTCTATCAA 


GCCTTGTTGG 


AGCAGAGTTT 


TGAGACGGAA 


TTGCGCAATC 


AAATCAATGC 


1740 


AGCTATTTTA 


GTTGAAAAGA 


CGGACTTTGC 


GCGCATTAAA 


ATGACCCTGT 


CCAATTATTT 


1800 


TTACAAGGTC 


AAACAGCAGT 


ATCCTTTGAC 


AGAAAAACAG 


CAGGAGCTCT 


ATGACATTTT 


1860 


AGGAGATGTT 


AATCCTGAGT 


ATGCCCTCAA GTATATGACG 


GCTTTTTTGT 


TGAAATTTCT 


1920 


CAAAAAAGAC 


CAGCTTATGC 


AGAAATGCCG 


TGATATCTTT 


GTGGACAGTT 


AGGTTGTCTT 


1980 


AGGCTATATT 


GTGCAAAATG 


AAGATGGAAA 


GTATGAGTTG 


GCTATCGATT 


TTGATAAGGA 


2040 


GAGGTTAACT 


TTCTACTTAG 


CGTGATTTCT 


TGTTTCTGAG 


TACATTGTTT 


GACTTTCCTT 


2100 
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AGTATTCGGT 


ATAAACTATA 


TGTAACCGGT 


AACACATATC 


GGAATAAACT 


AAAGGAGACA 


2160 


ATCATATGTC 


ACTTGAAAAC 


AAATTGGAAC 


AAGCAACAGG 


CGCTGTCAAA 


GAAGGTTTTG 


2220 


GTAAAGTTAC 


TGGAGACAGC 


AAGACAGAAC 


TTGAAGCAGC 


TGTTGAAAAA 


ACAGTTGCTA 


2280 


AGGCAAAAGA 


CGTTGTAGAA 


GACGCAAAAG 


GTGCTGTAGA 


AGGTGCCGTT 


GAAGGTTTGA 


2340 


AAAACGTTTT 


TACTAAAGAA 


TAGGAAAAAA 


TCAAGGGTTT 


CATTTTCCCT 


TGATTTTTTC 


2400 


TATTCTTATA 


AATAATTTTC 


TGCGACGGCT 


GTATCTCCTG 


GGTAGGATTC 


TTTCTTGCCC 


2460 


TGGATGATTT 


GGTAACAATC 


GGCTCCCTTA 


CCCGCAATAA 


TAACTGCATC 


TAATTCGTGA 


2520 


TTTGTGATAG 


CCATTGCCGC 


CTTGATGGCT 


TCTTGGCGAT 


CCGCAATCTT 


TTCAACAGGA 


2580 


TGATTGATCT 


AGCTACTAAT 


TTCATCTGCA 


ATGGCCATTG 


GGTCTTCATA 


GTTAGGGTCA 


2640 


TCAGCAGTCA 


GAAAGACTTG 


AATCTCAGGG 


TGTTGATTGA 


GGAGGAGGCC 


AAAGTCCTTA 


2700 


CGACGACTTT 


CTCCCTTGTT 


TCCTGTTGAT 


CCCAGAACCA 


GAGCAATCTT 


TCCGGTTTGA 


2760 


TGAGTTTCAA 


CCACATTGAT 


GAGTTTTTTC 


AGACTATCCC 


CATTGTGGGC 


ATAGTCGATG 


2820 


AAGACCTTGG 


CTCCATTTTT 


CTGAGTGAGG 


ACTTCCATAC 


GACCAGGAAC 


GCGGGTTGCA 


2880 


GCGATGCCTT 


TTTTGATGTC 


CTCAAGACTT 


GCTCCGAGAC 


GGAGACAAGC 


AAGTCCAGCA 


2940 


GCAACTGCAT 


TTTCTTGGTT 


GAAGTTGCCA 


ATGAGTTGAA 


TATCATAATC 


TCCAGCGAGT 


3000 


TTACCCGTAG 


CTGAAAAGCT 


AAAGGCTTTG 


GAATTCTCGA 


TTTGGTTATC 


AAATTGGCTA 


3060 


CCATAGAAAT 


CATGGTCTTG 


ATCTTCAACC 


TGTTCTTTCA 


AGACTGAGAA 


GTGGTCCATG 


3120 


TCACTGTTAA 


TGATGACTGC 


TCGGCTCTTT 


TCCATCAAGA 


GACGCTTGTG 


GTAGAAATAG 


3180 


TCTTCAAAGC 


TAGGGTGTTC 


AATCGGGCCG 


ATATGGTCTG 


GGCTGATATT 


TAGGAAAACT 


3240 


CCCACATCAA 


AGGTTAGACC 


ATAGACACGT 


TTGACCAGAT 


AGGCTTGACT 


GGAGACTTCC 


3300 


ATGATGAGGT 


GGGTACGGTC 


ATTTTGCACA 


GCCTGATTCA 


TCATGTCAAA 


GAGGTCAATA 


3360 


CTCTCAGGGG 


TTGTCAACGC 


TGACTTAAAG 


AAAGTCTCGC 


CATCAAGAGT 


TGTGTTCATG 


3420 


GTCGACAACA 


TAGCAGGTCT 


ATGCCCTTGA 


GATAAGATGT 


TATAGGCGAA 


ATAGGCTGCT 


3480 


GTTGTCTTAC 


CCTTAGTACC 


AGTAAAGGCA 


AGGAGTTTGA 


GTTTTTCCTG 


TGGATTACCA 


3540 


TAGAACTCCA 


TGGCAATCAA 


ACTCATGGCT 


TTCTTTATAT 


CGTTCACAAT 


GATGACAGGG 


3600 


ATACCGACTT 


CGTAGTCCTT 


TTCAGCTACA 


TACCAAGCTA 


ATCCTTGTGT 


TATAGCAGAA 


3660 


AGAAGGTATT 


crrrrTTAAA 


GGCAGCGCCT 


TTTGCGAAAA 


AAAGAGTGTC 


TTCTGTTACT 


3720 


TTTCGGCTGT 


CGTAGCTGAT 


GCTATCAAAA 


ATAACTTTGC 


TGTAGTTGTA GTGGTAATGA 


3780 


CCTTGGTCAA 


TAATTTCGCG 


AAAAAGGCCA 


TCTTTCTTTA 


AAATATCTAA 


TACGGTTTCA 


3840 
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ATCTTAATCA TACTTTCTAT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3900 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4020 

GTAACTTTAT CAGTCGCCTA CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GCGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4200 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GCCTTTATGA 4260 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTCCTTGG CTAGCAGACT 4 320 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4 380 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 4440 

ATGCCATGAG CCAAATTGCT GAGCAGGTCA TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4560 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CAGTCTTGAT TTATTTCCTT GCCCAAGAAG 4 620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4680 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4740 

TTTTG 4745 
(2) INFORMATION FOR SEQ ID NO: 219: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1900 base pairs . 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATT G TTTCTG CCGACTGGTA AAGCATCTGA 360 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCTGTTTGCC TTTAAATCGC 480 



WO 98/18931 



PCT/US97/19588 



1229 

CATTGGAAGC GGAgCTTGTC ATAAAAGGGA AACTCGATAA ACAGGACTCC CAAGCCCACA 540 

CAGAGACTGG CAAGGACGTC TGATGGGTAA TGAACTCCCA GATAGACTCT TGATACCAGC 600 

ACACTGACTA GGTAGAGGCC AAGGACGATT TGTACGATTT TTCTCCAGAC CTGATCTTTA 660 

ATCCGCTGAC TAAGAATAAC AATCAAAGTC CCTACCATCA GCGTTACAGC TAGAGAATGC 720 

CCACTTGGGA AGGAAAATCC CTTCTCCTCC ACCAGATGTA AAATAGCTGG TCGTGGGCGC 780 

TGGTAGATAT TTTTAAAGGT CACGATTAAA AGACCTGCCA AAGCCAGATT TCCCAGCATG 840 

AAGAAACTTT CTATCTTCCA TCGCTTACGA TAAAAGACAA AAGCTGTAAT GACAACCCAA 900 

GTGATAATCA CTGGGATATC AATCAGACGT GTGAGGGCTC GAAAAAGAAT AGTCAAATAA 960 

TCTGGTAAGT CTCCTCGAAT GGCAGTCTGA ATCGATTGGT CAAAATTGAC CAACATTTCA 1020 

GGGTAAAATT TGACCATGTA GCCAAGAATA ACGAAAAGTA AAAGGGCAAA ACTGCCCTTC 1080 

ATTAAAAATG TTTGTTTATC TCTCATAATG TTTTAAGGTT GGTTTCAAGA GAACATACAA 1140 

CAACCAGAAT GAAACGGAAA AGATAACACC TTCAATCAAG TTAAAAGGTA ATACCATGGT 1200 

CATTAGGTAG TTGGAAAGTC CCAAAATTTT TCCAATATCA AAGTTAGCAA ACTTAGCGTA 1260 

CAAAGGAACA GCATAAACAT AGTTGAGAAC CAACATGGCC AAGGTTAAAC CAATAGTTCC 1320 

AGCTAGAGAG CCTAGTAGGA AAOGAAGGGT TGTCCGTTCC TTTTTCCAAA TCAAAGCAAA 1380 

TACGATGACA AAAACTCCCA AAGCTACGAT ATTCATCGGC AAACCAATGT AAGTATTCAC 1440 

TCCTTGGCTG TTAAGAAGCA ATTTCAAGAG TGAGCGAAGC AAGAGCACTC CTAGAGmCsC 1500 

AGGCAAATCC ATGACCACCA GACCCACAAG GACTGGCAAG ATACTAAATT CGATCTTGAG 1560 

GAAAGATGCC GCTGGTAAAA GCGGAAAGTC AAAGTACATC AGCACAAATG AGATGGCTGA 1620 

TAGAATTGCA ATGGTCGAAA GTCGACGTGT GTTTGTCATA ACAGGTTCCT CCAATTTTCT 1680 

ATAAAATCAG AAGAAGTTGG AAAGGATTCC TCTATCTATT CTCACTTTTT ATATCCCAAA 1740 

AGTTCCCTCT TACTCTATTA AAGAAAAACA AAGCAAGTGG TTACAATCCG GCTATAAATC 1800 

TATCAAAACA GACAAGGCTA TTCTTTCGTC TTCTCCCATC CAGACTATAC TGTCGGTTGT 1860 

GGAATCTCAC CACATCACGT TGCGCTCACG GACTTCTTTA 1900 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4692 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GGTTTTCCAG CAGGAGCTTC TCCTTTATCA GAATGACCAT CCCATCTCCT CACGATAGAT 60 

GAATAATGAT ATTTTTTACC ATGATAGTAA TTTGAAAAAG CCTAACCACC TCCTGAACCT 120 

TCTCCATATG TCCATACTCC TCCATCTGGA TATTATACAG CAGCTGATGC AGCTCCCAAT 180 

AATGTAAAAC TTGAAATAAG AGCTAGAGCA AGTAATCTAT GTTTTTTCGT TTTCATTTTA 240 

T T T rTT CT TT CAAAAAAAGC ACACCTTGAG CAACAATGCA ACAAAATAAA TCCTCCTCTC 300 

TCTTTTATTG AAACCGCTTT CTTATGTGAT AAGAATAACT TTTTTATtAT TTGTTGTCAA 360 

GGAAAAAATC GAATTTTTTA GATATTTTAC TATATTACCT CTGTGAATAA TATTATATAG 420 

TAGTTTTATT TCAAAATAAT ATGCAACCAG TACTAACCAA ATATAAAATA GATGCCATTA 480 

ACGAATTTTA TTCAAGTTTT TCCCATTCAT ACTATACAAG TAAAAGAGAT GGTGTTAACT 540 

AAAAAGCAAT TCAAACTATT GTAAAATTCC TAGCAAAAAG AGAGCCGAAA CTCTCTTTTT 600 

TATCTTCTTT TACTTTTTTT GACTGGCATG AGTGTGATGT CTCTAACACT AAAGTAAGCT 660 

AGGATCAACA TGGCTATTGC TAGGAATATT TCTGTTGGTA ATTGAAAAAT TTTCAGAAAA 720 

GATAGAACCA ATAAAATCAA GAGTGCCACT AAAATACATA CCATAGCGAC GATATTGACA 780 

GTCCCTTTAA TGCTTTCTGG TGTCGCAAAT ACATAGAGTA GGAGCAGTAA AATTCCTAGG 840 

ACTAAATAGA CCATCTTTCT CTCTTTCTAG CTCTTATTCA GCTGATTTTT TCTTCTTGTT 900 

AGCTTTCTCA CGCTCTGCTT TGTTAAGGAT TTGTTTACGC AAACGGATAG ACTCAGGCGT 960 

TACTTCCATG TACTCATCGT CGTTCAAGAA CTCAAGAGAC TCTTCAAGTG TCAAGATACG 1020 

AGGCGTCTTG ATAACAGCTG TTTGGTCCTT AGTAGCTGAA CGAACGTTGG TCATTTGTTT 1080 

TGCCTTCGTG ATGTTAACTG TCAAGTCATT TTCACGAGAG TTTTCACCGA TGATCATTCC 1140 

TTCATAAACC TCAGTACCTG GGTTGACAAA GATCGTACCA CGTTCTTCGA TAGACATGAT 1200 

TGAGTAAGTT GTAGCCTTAC CAGCATCGAT AGAAACAAGG GCACCACGGT GACGTCCACC 1260 

AATTTCCCCT GGAATCAATG GCAAGTATTG GTCGAAGGTA TGGTTCATGA TACCGTAACC 1320 

ACGAGTCATT GATAAGAACT CAGTTGAGTA TCCAATCAAA CCACGCGCTG GAACAAGGAA 1380 

GACCAAACGA GTTTGACCAT TACCAGTTGA AATCATATCC AACATTTCAC CTTTACGTTC 1440 

AGAAAGGCTT TGGATAACAG ACCCTTGGTA TTCTTCTGGA GTGTCGATTT GTACACGTTC 1500 

AAATGGTTCA CATTTAATAC CGTCGATTTC TTTTACGATA ACTTCTGGAC GAGATACTTG 1560 

AAGTTCATAG CCCTCACGAC GCATTGTTTC GATAAGGATT GACAAGTGCA ATTCTCCACG 1620 

TCCTGAAACA GTCCATTTAT CTGGTGAATC AGTTGGGTCA ACACGAAGGG AAACGTCTGT 1680 

TTGCAATTCT GCCTGCAAGC GTTCTTCCAC CTTACGAGAA GTTACCCATT TACCTTCTTT 1740 
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ACCAGCAAAT GGTGAGTTGT TGACCAAGAA AGTCATTTGA AGAGTTGGCT CATCGATGTG 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG GTTTCACCGA CAAAGATGTC 1860 

TTCCATACCT GAAACGGCAA TCAAGTCACC CGCTTTGGCT TCTTGGATTT CACGACGTTC 1920 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACGGAAGTTT TTAGTTGTAC CGTCAAGTTT 1980 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 2040 

ACGTCCAACG AAGTCATTGT AGTCCAAAAG TGACACTTGG AACTGCAAAG GCTCATCTGA 2100 

GTTATCTACT GGAGCTGGGA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGTCA GCTGGATCAT CTGACAATGA AGAAGTTCCG TTGATCGCTG AAGCATAAAC 2220 

CACTGGGAAA TCAAGCTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 2280 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 2340 

GACAAGGTCT TGTTCCAAGG CTT T TTTCAA TACGAAACGA GTTTGTGGCA TGGTTCCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTCTCC 2460 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTGATA CGAGTTCCGT TGTAAGCAAC 2520 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCCATATCGT TTGAGTCCAT 2580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTCT GATTGTTTCA ATAATTCGTC 2640 

AACCAGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC GGATATCTTC 2700 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 2760 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTGCTT 2820 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2880 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCCATGAAG CAAGCACTCC 2940 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AATATCGATA 3000 

CAGGACTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATGAA CACACCTATC 3060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AGACAAGGAA CTTCCGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 31 BO 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTGGGC TTTCAGCTCC 3240 

TCCATCCCCA ATATCATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAC 3360 

CCGCAAAACT AGAGATTCTA TCTGCAAGTC sCTCCCTCAG TCAAGCCTCT ATCACCATTT 3420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GGTTGGTGTT AAGGTGACTA 3480 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAACGTAACT 3540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG ATGAGTCGAT 3600 

AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG GCAAGACCAA TCCTATGATA 3780 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3840 

AAAAGTGGAT T TTT C TTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3900 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3960 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 4 020 

AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC GTTGGTTTTC 4080 

GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA ACTGACTTGA 4140 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAATC 4200 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 4 260 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 4320 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 4380 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4440 

GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4500 

ATGTAAACAA TAATACCAAA TCGTTTGAGC AATTGCTGCA CATCATAGAA TGTTTTCATA 4560 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA ATGCTCCTTA 4620 

TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATGGTA CAATAAAAAC ATGGGGATTC 4680 

AACAATTACA TT 4692 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 
1C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 
GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 
AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGGTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTGAGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGC CGAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 706 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xij SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
CAGCTGATCG GCAATATCAG TCATAGAAAT TTTTTCAATT AACTTTTGAG CAATTTTTTG 60 

GTTGATGATA CGAGGGATTT GGTGATTTTT CTTTACCAGG GGAGTCTCAG CAACCATCAT 120 

TTTTGAACAG TGATAGCACT TGAAACGGCG TTTTCTAAGG AGAATTCTAG AAGGCATACC 180 

AGTTGTTTCG AGGTAAGGGA TCTTAGACGG TTTTTGAAAG TCATATTTCT TCATTAGACT 240 

TCCACAATCA GGGCAAGATG GAGCCTCATA ATCCAGCTTA GCGATAATTT CTTTGTGGGT 300 

ATCCATATTG ATGATATCTA GAATCTTGAT GTTTGGGTCT TTAATATCGA GCAGTTTTGT 360 

GATAAAATGT AATTGTTCCA TATGATTCTT TCTAATGAGT TGTTTTGTCG CTTTTCATTA 420 

TAGGTCATAT GGGACTTTTT TTCTACACAA AAATAAGCTC CATAATATCC ATAGGGGATT 480 

TACCCACTAC AAATATTATA GAGCCCGAAA ATATGGGAAA ACTGATCCTT GTTTCTGCTT 540 

TTGTCTATAG AAGAATAATA AAGATTATCT TCTTCAAATT CTCCGATATT CTCTAAAGTT 600 

TTGTGCAAGT TGCACAGAAC TTGTTTATTT TTTTGGTCAT CTTGCCATAG AAATATAAAG 660 

CGTTTTCATA TATAATATAA TTATCAAAAG ACAAAAGGAG TTCACCTCAT GGTAGAATTG 720 

AATCTTAAAA ATATTTACAA AAAATATCCA AACAGCGAAC ACTATTCAGT TGAAGATTTC 780 

AACTTGAACA TCAAAGATAA AGAATTTATC GTTTTCCTAG GACCTTCAGC ATGTGGTAAA 840 
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TCAACTACAC TCCGTATGAT TGCTGGTCTT GAAGACATTA CAGAAGGTAC TGCATCTATC 900 
GATGGCGTAG TTGTCAACGA CGTAGCTCCA AAAGACCGTG ATATCGCCAT GGTATTCCAA 960 

AACTACGCTC TTTACCCACA CATGACTGTT TATGACAACA TGGCTTTCGG TTTGAAATTG 1020 

CGTAAATACA GCAAAGAAGA CATTAACAAA CGTGTTCAAG AAGCAGCTGA AATACTTGGA 1080 

TTGAAAGAAT TCTTGGAACG TAAACCAGCT GACCTTTCAG GTGGTCAACG TCAACGTGTT 1140 

GCCATGGGGC GTGCGATTGT CCGTGATGCG AAAGTATTCT TGATGGACGA ACCTTTGTCA 1200 

AACTTGGATG CCAAACTTCG TGTATCAATG CGTGCTGAAA TCGCTAAAAT TCACCGTCGT 1260 

ATCGGAGCTA CAACTATCTA TGTAACTCAC GACCAAACAG AAGCGATGAC ACTTGCAGAC 1320 

CGTATCGTTA TTATGTCAGC TACTAAGAAC CCTGCTCGTA CAGGTACTAT CGGACGTGTA 1380 

GAACAAATCG GTACTCCTCA AGAACTTTAC AAAAATCCAG TTAACAAATT CGTTGCAGGA 1440 

TTCATCGGAA GCCCAGCTAT GAACTTCATC ACCGTGAAAT TGGTTGGTAG CGAAATTGTT 1500 

TCTGACGGTT TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG TGAAAAAGGC 1560 

TACGAAGGAA AAGAATTGAT CTTTGGTATC CGTCCAGAAG ACGTGAATGC AGAACCTGCT 1620 

TTCCTTGAAA CATTCCCAGA CTGTGTTGTA AAAGCGACTA TCTCTGTATC AGAACTGCTT 1680 

GGTTCAGAAT CTCACCTTTA CTGTCAAGTT GG7AAAGACG AGTTTGTTGC AAAAGTTGAT 1740 

GCTCGTGACT ACTTGCAAAC AGGTGCAACA GTTGAGCTTG GATTTGACTT GAACAAAGCA IB 00 

CACTTCTTOG ATGTAGAAAC TGAAAAAACA ATCTACTAAA ATAAATAAAA TTCAAAGCAC IB 60 

TACAAGAAAA GATATCTCTT TATCAATTGT AGTGGAGAGA TATCAGTTAA TCTAGGGAGA 1920 

GAAACAAAAT GCTTCTCTCC TTTTTGCTAG AGAAGTCATA TTATGCATCT ATATTGTGAT 1980 

GCTCTTTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT GCCGTACCTA 2040 

TGATTACTGA TTTCGTCAGT TTTATCTGCA ACCTCAAAGA TGTACTTTGA GCAGCTTACG 2100 

GCTAGTTTCC TAGTTTGCTC TTTGATTTCC ATTGAGTATT ATTTGTGGGT ACCATCTACA 2160 

AGTGAAGCTA TATGCGTAAA CTACGTGAGC AATTGAATTC GAACTAGAGA CGTAATAATA 2220 

AATTTATGCT ATAGTTATGG TGACTTGTAT GCTTTTGATT CTAGTTTATC AAATAATAGA 2280 

TTAGAATTGT CAGATAATAT CATTTTGTGT TATAATGAAG AAAAAACAGA GGTGTTCAAA 2340 

TGTCAGAAGC AGGTCATAAG TTTTTAGCAA AATTGGGGAA AAAACGCTTA CGTCCAGGTG 2400 

GAAAGCGTGC CACAGATTGG TTAATTGCAG AAGGAGGATT TTCAAAAGAA AAGAGAATAC 2460 

TAGAGGTTGC GTGTAATAGG GGAACTACAG CAATTGAGTT GGCACAGCGT TTTGGTTGCA 2520 

AGATAACTGC TGTTGATATG GATGCTCAAG CTTTAGAAGT GGCTAAAAAA TCTGCTGGAA 2580 

CGGCAGGTGT TGCTCATTTA ATCAGTTTTG AAAGAGCAAA TGCAATGAAA CTTCCTTATC 2640 
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AAGATGCTAG TTTTGATATT GTTATAAATG AAGCTATCCT GACTATGCAA GCCGATCAAG 2700 

CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA ACCTGGAGGT CTTCTCTTGA 2760 

CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG ACAGGAATTA TCACAAGCAA 2820 

TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGGGA ACAGGTGATG ATAGAATCAG 2880 

GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT AATGAAATTA TCGGGTATGA 2940 

TTTATGACGA AGGTTTGCTA GGAACTTTGA AAATTTGTGT AAATGCTTGT AAAAAGGAGA 3000 

ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA GAATAAACAG AAATTGGGCT 3060 

TTATTGCGAT GGCTAGTTAT AAATCGTCAA AACGTTAGAT AATTATTCAA GTTAACTTTT 3120 

CCTTTTTTCT TTCTTAAAAA ATATGCTATA ATAGAGAGTA AAAAACTTTG AAAGAAAGAA 3180 

AAAGATGAAT TTAAAAGATT ACATTGCAAC AATTGAAAAT TATCCAAAGG GTACCG 3236 
(2) INFORMATION FOR SEQ ID NO: 223: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2885 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS ; double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

CCTGACTTTT CAAATTGGTT AGTTTGCCAC ACTTGGTTTA TATGGTCGTG GAAAGCATGG 60 

CTATTACTTC TCAAAGGGCG ATTTCTCACC CCATGAAAAG TGTCTATTTT TGTTTAGGTT 120 

TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA CCGCTTATAT 180 

TTGATCCCAA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG GCTATAATTC 240 

CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA AATGGTATTT 300 

TCACGATTGT TTAGCACGTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT AGGCTTAATA 360 

ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT CATGAGTTCA 420 

GTAGCTTCAA ACTCTGTAAG TGTTGCACCT TTACCATAAT TCCACCCCTC TTTAATTTGT 480 

TTGATCATTT CGGTTAGGCT TACAAAAAGC ACACTCTTAG GTTCTCCTTT TGTCTTATAC 540 

CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT CCCTGTACCT 600 

CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGTACCTG ACCTTTTGCA 660 

AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AAT CATC AAA AGTTTTAGTT 720 

TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT TTGCATATAT 780 
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AGTTTAGCAT TCAAATTATC AGCAATCGCA TCTTCTTCAT CTTGCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATAGGG TGGGATACAG CGAACTTCTT TTATTGCCTC TCCGTTCTCA 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTCTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTAGTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCGT CAATAAATTG TCCT G TTTCC AACGGTTAAG CATTACCTTG ATGTATGCAA 1200 

AGTTTGCTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 1260 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTCCACACT GGTTACTTTT TTATTTACAA 1380 

AATCAGAAAT CAGTTGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAATATC 1560 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTG 1620 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GGGCCAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 1740 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTCATC 1800 

TTTGATGGAT TTTGTCTGAC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT 1860 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 1920 

TTCTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 1980 

TTCTAAACAT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 2040 

CCATATCCTC AAATGCCTGG TACTGCTCCA ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTTCT CTTAACTGCT TAGCGTTCAT CTCTGTTACT GCCTTTAG TA 216C 

GCAAGTTGTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATGTT TCTATACTGT 2220 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG CAGACGAATT TCAGAAAGTT 2280 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCTGTTGCTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 2460 

AGTCAGCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 2520 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 2640 
TCTTTTTTAC CCTCTAAAAC GGCTTTGGCT AGTGGTTAAT ATTATTTACC ACTTGTCTCT 2700 
ATAAACGTGT TAGAGCCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 2760 
TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 2820 
AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 2880 
CTTGG 

(2) INFORMATION FOR SEQ ID NO: 224: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 



{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
TATCAATCCT TTCCCATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 
ATGTATTTTT ACGAGTCAGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTTTTGAC 
TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAC 
ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTA CTATAAAATG 
AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA AAAACATTAA 
AACGTCCTGA GGTTTTATCA CCTGCAGGGA CTTTAGAGAA GCTAAAGGTA GCTGTTCAGT 
ATGGAGCAGA TGCTGTCTTT ATCGCTGGTC AGGCCTATCG TCTTCGTAGC CGTGCGGGAA 
ACTTTACTTT CGAACAGATG GAAGAAGGCG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 
TCTATGTAGC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 
TCCGTAAACT GCGTGATATC GGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 
TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACCCAA GCCAGTGCCA 
CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 
GTGAGGTTTC AATGGAAGAA TTAGCTGAGA TCCGCAAACG TACAGATGTT GAAATTGAAG 
CCTTTGTCCA TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 
TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACGACC 
TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 
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TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG ATTGAAAATG 1020 
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GTGTGGACAG TCTAAAAATC GAAGGACGTA TGrAGTCTAT TCACTAyGTA TCAACAGTAA 1080 

CCAACTGCTA CAAGGCGGCT GTGGATGCCT ATCTTGAAAG TCCTGAAAAG TTTGAAGCTA 1140 

TCAAACAAGA CTTGGTGGAC GAGATGTGGA AGGTTGCCCA ACGTGAACTG GCTACAGGAT 1200 

TTTACTATGG TACACCATCT GAAAATGAGC ACTTCTTTGG TGCTCGTCGT AAAATCCCTG 1260 

AGTACAAGTT TGTCGCTGAA CJl' G GTTTCTT ATGATGATGC GGCACAAACA GCAACTATTC 1320 

GTCAACGAAA CGTCATTAAC GAAGGGGACC AAGTTGAGTT TTATGGTCCA GGTTTCCGTC 13 BO 

ATTTTGAAAC CTATATTGAA GATTTGCATG ATGCTAAAGG CAATAAAATC GACCGCGCTC 1440 

CAAATCCAAT GGAACTATTG ACTATTAAAG TCCCACAACC TGTTCAATCA GGAGACATGG 1500 

TTCGAGCTCT TAAAGAGGGG CTTATCAATC TTTATAAGGA AGATGGAACC AGCGTCACAG 1560 

TTCGTGCTTA ATGTAGTTGT TTAGTTTTAA AAAACTATGC AAAGCTCCAT ATACAACACT 1620 

TAAACGAGAT TAAAGAATGG CGAAATCCCT TGATGCGCAA GAGATTAGCT GTCTTTTTTA 1680 

TTTTTTAAGT GATAAAGTCG GAGTTTAGGC ATCAAAGCCT ATCAAATTAA ACAAAGAAGC 1740 

GATGTCTTAG ATATTTTGAA AAAAATTAAT AAGCAGAAAA CTCTCTATTA TTTTGTTGTA 1800 

GAGAGTTTTT TGTTAATAAA ATTTCACAAA ATGACATTTA TATATTGCAT TAAGTTAGAT 1860 

ATATGATATA ATATTGTTAA AAAGAGGCCC AACTTTTTAA AATTAATGAG AATCAAAGAG 1920 

AAAACCAATA ATATTAATGG AGGAATAAAA AATGTAAGTA AGCATTATGG TCATTCAATC 1980 

ATTCTCAAAG ATATAAATTT TGCACTTAAC AAGGGTGAAA TTGTTGGTCT AGCAGGGAGA 2040 

AATGGAGTTG GTAAGAGTAC GTTGATGAAA ATTCTTGTTC AGAATAATCA ACCGACTTCA 2100 

GGTAATATTA TAAGCAGTGA TAATGTTGCG TATTTAATCG AAGAACCAAA ATTATTTTTA 2160 

TCTAAAACAG GTTTAGAGAA TTTAAAATAT TTGTCAAATT TATATGGTGT TGACTACAAT 2220 

CAAGAAAGAT TTAGATGTTT GATCCAAGAG TTAGATTTGA CTCAGTCTAT TAATAAAAAA 2280 

GTAAAGACCT ATTCTTTGGG TACAAAACAA AAATTAGCTT TGCTTCTAAC TCTCGTTACG 2340 

GAACCTGATA TATTGATTTT AGATGAACCG ACTAATGGTT TAGATATTQA ATCATCACAA 2400 

ATAGTTTTAG CGGTTCTAAA AAAATTAGCT TTACATGAAA ATGTGGGAAT TTTAATATCG 2460 

AGTCATAAAT TAGAAGACAT TGAAGAAATT TGTGAGAGAG TTCTTTTCTr GGAGAACGGG 2520 

CTTTTGACAT TTCAAAAAGT AGGAAAAGAT AGTCATAATT TCTTGTTTGA GATAGCTTTT 2580 

TCATCAGCTA CAGATAGAGA CATTTTCATT ACCAAACAAG AATTTTGGGA TATTGTTTAG 2640 

GAAGAGGGAT TGAGAATTAC TATGTCTGGG AATATTCAAA ATAGTGAGCT TTTTAAATTT 2700 

TTTAACGAAA ACTCTATTAA AGTAGTTCAT TTTGAAACTA AAAAAGAGAC GCTTAAAGAT 2760 

ATTTACCTAA ATCGTTCAAA ATAAAGGAAG GTTATAATCA TGAAATTAAA TAAACAGAAG 2820 
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AATCGGATGA TTTACGTCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 2880 

GCTTTGAATG GCATTGTGTT ACTAGTCATA GTAAGTAAAT TGGGTATTCC AGGTGATTTA 2940 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 3000 

CTATTATCTT ACATTTTCTA TTTATACAAA TTGAAAAGTG GCTTGGTATw TGGTATTTTA 3060 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATACGA TGATGATGAA TACTAGTAAT- 3120 

GATTTGTTTA TCAAAGCAAT TGAA 3l44 
(2) INFORMATION FOR SCO ID NO: 225: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

TACGGTATTA TTTTTAAGCA GAAAGAATCA TGAAAATCAA AAAATGGCTT GGTCTAGCAG 60 

CCCTTGCTAC AGTCGCAGGT TTGGCTCTTG CAGCTTGCGG AAACTCAGAA AAGAAAGCAG 120 

ACAATGCAAC AACTATCAAA ATCGCAACTG TTAACCGTAG CGGTTCTGAA GAAAAACGTT 180 

GGGACAAAAT CCAAGAATTG GTTAAAAAAG ACGGAATTAC CTTGGAATTT ACAGAGTTCA 240 

CAGACTACTC ACAACCAAAC AAAGCAACTG CTGATGGCGA AGTAGATTTG AACGCTTTCC 300 

AACACTATAA CTTCTTGAAC AACTGGAACA AAGAAAACGG AAAAGACCTT GTAGCGATTG 360 

CAGATACTTA CATCTCTCCA ATCCGCCTTT ACTCAGGTTT GAATGGAAGT GCCAACAAGT 420 

ACACTAAAGT AGAAGACATC CCAGCAAACG GAGAAATCGC TGTACCGAAT GACGCTACAA 480 

ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CAGCTGGCTT GATTAAATTG GATGTTTCTG 540 

GAACTGCTCT TGCAACAGTT GCCAACATCA AAGAAAATCC AAAGAACTTG AAAATCACTG 600 

AATTGGACGC TAGCCAAACA GCTCGTTCAT TGTCATCAGT TGACGCTGCC GTTGTAAACA 660 

ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC ACTTTTCAAA GAACAAGCTG 720 

ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA AAAAGATTGG GAAACATCAC 780 

CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA CACAGATGAC GTGAAAAAAG 840 

TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG GTAATAAGAA ACAGGGAGGT 900 

GGGAGAGAAA ATTCCACCTC TTGCTTTTGT ATAGAGTATA GATTGTAAAG AAGACTATTC 960 

GTTCATAGAA AGGTAGAGAG AATATGGTTT TTCCTAGCGA ACAAGAACAC ATTGAAAAAT 1020 
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TTGAAAAGGA TCATGTAGCC CACCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 1080 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAAGCGTCT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCGCCC TTTGTCATGG 1200 

CACATTTCAA GAGTTCGCGT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 1260 

CTGTGCCAGC GGATGCGGAT CAGGTCTGGA CAGAGGATCC kTTTACCCTT TCGGTCCGCA 1320 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 1380 

GTGCTTTGAG AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AAGCATGCAG 1500 

ACAAACTCCG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG GACCAAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 1620 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 1680 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGGTTGAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 1860 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 1920 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 2040 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTTGATG 2220 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TGTGCGAATC GCTGATTATT ACACCCATAT CGAATTAGTA GAGGAGCTGA 2340 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 2400 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 2460 

TACGGAATCG TTGGATATTC TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 2520 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTGACGACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG .2640 

CATTTTAACC TGATGAGCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGAAGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 2760 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAGCGTGTG 2820 
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GCAATTGCCC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCACACGA GTCAACTTCT 2880 

GCCCTTGATC CGAAGACAAC CAAGCAGATT TTGGCCTTGT TGCAAGATTT GAACCAAAAA 2940 

TTAGGCTTGA CTGTTGTCTT GATTACGCAT GAAATGCACA TTGTCAAAGA CATTGCCAAC 3000 

CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGC GTAGTGTGCT TGAAATCTTC 3060 

TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 3120 

GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGGAACACT TGTCTGAAAA CAGTCTCTTG 3180 

GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAG 3240 

CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 3300 

GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 3360 

GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGC GAGTACAGTA AGATGGAATC 3420 

ATTGATTCAA ACCTATTTAC CAAATGTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 3480 

GGGAACGGCT ATCTACTTAA CTCTTTATAT GACAGTTCTT TCCTTCATTA TCGGAGGCTT 3540 

CTTGGGGCTA GTGGCAGGTC TCTTTCTCGT CTTGACAGCG CCAGGTGGTG TCTTGGAGAA 3600 

TAAAGTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 3660 

CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 3720 

AAATGCAGCC CTTGTCCCAC TTTCTTTTGC AGTCTTTGCC TTCTGG 3766 

<2> INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC CGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATCGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 120 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC GGTGTCACTG TCTTTGGTGG 360 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATGA CTGTGACTTC 420 
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AGAAAGCAAG GGCAATGCCG CGATTGCTGG TTTGGTAATC GCTTTGTCAT TGATGGCGAT 480 

GATTCTTGTC GGATTGAAGA TTACTGGACT TTCAGTAAAC CCAGCTCGTA GCTTGGCACC S40 

AGCTGTCTTG GTAGGCGGCG CAsCCTTCAA CAAGTTTGGA TTTTCATCCT TGCACCAATC 600 

GCTGGTGGAG TTCTTGCAGC CCTTGTTGCA AAAAATTTCC TTGGAACAGA AGAATAATTG 660 

AAACTCAAAA AGCCTTGCTC CTCATCTTGA GGAACAGGGC TTTTTCGTAT GATACTCTTC 720 

GAAAATCTCT TCAAACCACG TCAGCTTCAT CTTGCCGTAG TATGGTTACT GACTTCGTCA 780 

GTTCTATCCA CAACCTCAAA ACAGTGTTTT GATCTGACTT CGTCAGTTCT ATCTGCAACC 840 

TCAAAACAGT GTTTTAAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA ACAGTGTTTT 900 

AAGCTGACTT CGTCAGTTCT ATCTGCAACC TCAAAACAGT GTTTTAAGCT. GACTTCGTCA 960 

GTTCTATCTG CAACCTCAAA ACAGTGTTTT AAGCTGACTT CGTCAGTTCT ATCCACAACC 1020 

TCAAAACAGT GTTTTGATCT GACTTCGTCA GTTCTATCCA CAACCTCAAA ACAGTGTTTT 1080 

GATCTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT GCTTTGAGCA ACcTGCGGCT 1140 

AACTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATGACT TTAGCGGTTG TCAATTTTCT 1200 

CTGGATAAAG GTCGTGTTGG AAGAGGCGTT GTTCTGCCAA GCCCTCATAC TTAGTTCCTT 1260 

GCTTACCGTA GTTGTAGTAG GGGTCGATTG AAATGCCACC GCGCGGAGTG AATTTTCCCC 1320 

AGACTTCTAA ATAGCGAGGG TCTAGCAAGT TGACCAAGTC TTTCCCGATG GTGTTGATAC 1380 

AGTTTTCGTG GAAATCTCCG TGGTTTCGGT AGCTAAATAG ATATAGTTTG AGGGATTTTG 1440 

ACTCGACACA GAGCTTGTCA GGAATGTAGG AAATATGAAT CGTCGCAAAG TCTGGCTGAG 1500 

CAGTGATTTG TCCCAGCAGA GACATATCGA GGATATGGTG ACGAATGCCC TGTTCCTTAG 1560 

CGATTTCTCT AGTAATTTGA ATTTCGAGGT GATGACGTTG GCCGTAGGCA AAGGTGACAG 1620 

CTTCGACTGT TTCATAGTGT TGCATGACCC AGAAAAGGCA GGTTGTTGAA TCTTGACCAC 1680 

CACTAAAGAC GACCAAGGCT AATTGACGTT TCATAGTACT CCTTCCAAAA TGGGAAATGT 1740 

TCAGAGCACG CAAAAAGCTC CCATTAGGGA GCTAAAAAAT ACCAAATCGA GGTTTTTTTA 1800 

GCGATGGCAT ATCCCAAACA TCGTAATATT CTACTTATAT AGTAAAATGA AATAAGAACA 1860 

GGACAAATCG ATCAGGACAG TCAAATCGAT TTCTAACAAT GTTTTAGAAG TAGAGGTGTA 1920 

CTATTCTAGT TTCAATCTAC TATAGTCTAG CATATTTTTT GAAAAATGGC AAAGGGCAAG 1980 

AAAAAAGAGA CCAAAGAAAG TACTTGGTCT CTCGTTTGAT TAGCTCAATT CAGCAATGAT 2040 

GGCCTTGATT TGTTCTGCTG TGTGAACACC TGCAACTTGT TTGACAACTT GGCCGTCTTT 2100 

TTTGAAGAGA AGAGTTGGAA TAGACATGAT TCCAAAAGCA CGAGCTGTGT TTGGATTTTC 2160 

ATCAACGTCC ATTTTAACGA TTTTCAAGAC ATCTTCTGAA AGTTCTTCAG ACAATTTGTC 2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 2280 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATTGCTTTTG CCATTGTATT 2340 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 2400 

CATAAGTCCC ATGGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACAGGGTAGT 2460 

AkJtCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 2520 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5278 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D| TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

ACTCAGTTAG ATTTTGTTTT CAAAAACAAC GAAGAAAAAG ACCATGTTGC TCTACTTGGA 60 

AGAATTGGCT CCGAACGTGT TTATCGATAT ATTAATAAAA AATATTTAGA TTTACCGGAA 120 

ACATTCGAAA ATTATAATGT TTTTGTACCA GAAGCTAATG CAAGTGGTGC CTTAGGTGAA 180 

GTCTTATCAA CACCCCTAAT CGGGGAACCC CTAATCGGGC ATACAGATAC TTTTTTATCT 240 

ATTGGTAATT TTAAAACAAA ATTTGAAGCC GATGCTTGTA TTAAATTTAT TAAAACTAAA 300 

TTCGCTAGAG TATTATTAGG TGTTTTGAAA GTTACTCAGC ATAATTCACG CAAAACTTGG 360 

TATTACGTCC CCCTCCAAGA CTTTACGGTC AATTCGGACA TTGATTGGAC ACAATCAGTG 420 

ACTGATATTG ACCGCCAGCT TGATCAAAAA TATGACTTTT CCCCTGAAGA AATTGCCTTT 480 

ATTGAGAATC ATGTAAGGGA GATGGATTAG AAAACTATTT TTATTTGACA AATAGTGCTC 540 

AATGATCTAA AATGACTATA TAGGATTAGG TCAGGAAGCA TACGATGCCC TGACCCTTTT 600 

TGTACTTATG AGATGAGAAA GTCATTTGTT AGATAAATTG ACTCGTTAGC AAACGTTCAA 660 

AAAAGGAAAA CTTATGCCAG TAGAAATTAA AACCACTAAA GAAATTCATC CTAAAATCTA 720 

TCCCTACACC ACACCGACAG TAACCAGTAA TGAAGGCTGG ATTAAGATTG GGTATACAGA 780 

ACCTGATGTC ACACAACGTA TCAAGGAGCA AACGCATACA GCTCATATAG CTACAGATGT 840 

CTTATGGACT GGTGATGCAG CTTATACAGA AGAGCCTGAT AAGGGGAAAA CTTTCAAGGA 900 

CCATGATTTC CACCATTTCC TTTCTTTCCA TGATGTAGAA CGTCGTCCCA AGACGGAATG 960 

GTTCTATTTT AATGGAACTC CTGAAAAATC AAAAAATCTT TTTGATAAGT TTGTTCAGCA 1020 

TGATTTGTCT GGTTATCAGC CTGGAAAAGG ACAGGACTAT ACTCTGCGAC AAGAGCAAGA 1080 
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AGAAGCAGTT 


GCTAAGACAT 


TAGCTTATTT 


CCAAGAACAT 


CCTGGAGGCA 


AGTTTCTCTG 


1X40 


GAATGCCAAG 


CCACGCTTTG 


GTAAAACCTT 


GTCTACCTAT 


GACCTAGCTC 


GACGGATGGA 


1200 


AGCTGTCAAT 


GTCCTAATTG 


TAACAAACCG 


CCCTGCCATT 


GCTAACTCAT 


GGTATGATGA 


1260 


TTTTGAAACA 


TTCATAGCAG 


GTCAAACGAC 


TTACAAGTTT 


GTTTCTGAAT 


CAGATAGCCT 


1320 


TAAGAGTCGT 


CCAATCTTGT 


CACGACAAGA 


ATTTCTTGGT 


ATTTTAGCTG 


ACGATGTAAG 


1380 


ACAACTTGCT 


TTTATCAGTC 


TCCAAGACTT 


GAAAGGATCT 


GTTTATTTAG 


GTGGAGAGCA 


1440 


CGATAAACTC 


AAATGGGTAA 


CTGATCTGCA 


TTGGGACTTG 


TTGGTTATTG 


ACGAGGCTCA 


1500 


TGAAGGAGTT 


GATACCTTCA 


AGACTGACCA 


AGCCTTTAAT 


AAGATTCGAC 


GAAATTTTAC 


1560 


TCTGCATTTG 


TCAGGTACAT 


CATTTAAAGC 


ATTGGCTAAA 


GGAGATTTTA 


CAGAGGAACA 


1620 


AATCTACAAC 


TGGTCTTATG 


CTGATGAGCA 


GGCTGCTAAG 


TATTCGTGGT 


CTCTTGAGCA 


1680 


AGAAGAGGAA 


AATCCTTATG 


AAAGCTTGCC 


TCAGTTGAAT 


CTCTTTACCT 


ATCAAATGTC 


1740 


TCAGATGATT 


GGCGAAAAGT 


TAGAAAAAGG 


CGCTCAGATC 


GATGGTGAAA 


ATATTGACTA 


1800 


TGTTTTTGAC 


TTAAGTGAAT 


TTTTCGCTAC 


AGATGATAAA 


GGGAAATTTA 


TTCATGAGCA 


1860 


TGATGTCAGA 


AATTGGTTAG 


ATACTCTATC 


AAGCAATGAA 


AAATATCCAT 


TTTCAACCAA 


1920 


AGAACTCCGT 


AATGAACTCA 


AGCATACTTT 


TTGGCTTTTA 


GAACGTGTCG 


CTTCGGCCAA 


1980 


AGCATTAAAA 


GCCCTACTAG 


AAGAACACCC 


AATCTATGAA 


AACTATGAGA 


TCGTTCTAGC 


2040 


TGCTGGTGAC 


GGACGTATGT 


CCGAAGAAGA 


CGATAAAGTC 


AAACTCAAAT 


CCTTGGACTT 


2100 


GGTTAGAAAA 


GCGATAGCAG 


AGAATGACAA 


AACCATTACC 


CTATCCGTTG 


GTCAGCTGAC 


2160 


GACAGGTGTC 


ACTATCCCTG 


AATGGACAGG 


TGTATTGATG 


TTATCAAATT 


TGAAATCACC 


2220 


AGCTCTTTAT 


ATGCAGGCCG 


CCTTCCGTGC 


TCAAAATCCT 


TACTCATGGA 


GCGATAACAA 


2260 


AGGAAATCAC 


TTTCGCAAAG 


AAAGAGCCTA 


TGTATTTGAC 


TTTGCGCCGG 


AAAGAACCTT 


2340 


GATTCTCTTT 


GATGAGTTTG 


CCAACAACTT 


ATTGCTTGTA 


ACTGCAGCTG 


GTAGAGGAAC 


2400 






A l A 1 1 AOAbA 


All A 1 I AAA (J 


TTCTTTCCAA 


TTATTGCCtiA 


2460 


AGACCGTGCT 


GGTAAGATGG 


TTGAAATTGA 


TGCAAAGGCA 


GTTCTAACCA 


CTCCTCGCCA 


2S20 


GATAAAAGCT 


AGAGAAGTTC 


TTAAACGAGG 


TTTTATGTCC 


AATCTCTTAT 


TTGATAATAT 


2580 


TAGTGGTATT 


TTCCAAGCAA 


GTCAAACAGT 


TTTAGATATT 


TTAAATGAGC 


TCCCAGTTGA 


2640 


AAAGGAAGGG 


AAGGTACAAG 


ATAGTTCTGA 


TTTATTAGAT 


TTTTCAGATG 


TTACAGTCGA 


2700 


TGATGAGGGA 


AATGCAGTAG 


TAGACCATGA 


AATTGTAGTT 


AATCAGCAAA 


TGCGACTTTT 


2760 


TGGTGAAAAA 


GTTTATGGAC 


TTGGTGAATC 


TGTTGCTGAG 


TTAGTCACAA 


AAGATGAGGA 


2820 


ACGAACTCAA 


AAACAGCTGG 


TCAATGACTT 


GAGTAAGACC 


GTTTCTTCAG 


TGATTGTAGA 


2880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGGGAAACT GAGCAAATTA AGAAACAAAT 2940 

TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 3000 

TCATATCAAG CAAGAGTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 3060 

TAAGATTCAA GAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 3120 

ACTAGAACAA ACACTCAAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 3180 

TGAGATAAAA CGTGTGGAAC AGTTGAAACA ATCAGCTCAA GATGAAATTC GTGACCATTT 3240 

ACGAGGGTTT GCAAGAACAA TTCCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 3300 

ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 3360 

GATTGATCAG TTTAGATATT TGCGAGATGG TGGGCAGGAT TTTGCAGGGC ATCTCTTTGA 3420 

TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 3480 

TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 3 540 

AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 3600 

AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 3660 

TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 3720 

TCCTGAAGAA CGCTTAAAAC ATATTTTGGA AAAGCAAGTT TATGGATTTG CTCCGTCTGA 3780 

GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAAT CTTTCTAAAG ATATCAGTAG 3840 

GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 3900 

GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 3960 

GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 4020 

CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 4080 

GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 4140 

AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 4200 

AACTCCCTGA TAATTCAAGG AGTTGTCTAT AGTTAAATTA GTTTTTAGAA GCTTCTTGGA 4260 

ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTTAGCA GATGCTTGCA 4320 

TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG ACGAACGATA CCATGTGCAC 4380 

CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 4440 

CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGC 4500 

CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 4560 

GAACACCTTC GAACAATTCA ATCAATTCAG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 4620 
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ATTCTTCAAG GTTTACACCA GCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4 680 

CGTGTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT TTTTCAGCAA 4740 

GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4800 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 4860 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GftTTGATTTG TTGATAGCAA 4920 

GGTTTTTACC TACAACGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 4980 

TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT 5100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 5160 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTAGTTGAAG 5220 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 5278 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



Txi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
ATAAGGAATC TCTAAAAAAT TTTAAGGAGA ATCTAGCAAA TGGATTTCAC ATGGGCACTG 
AAGTATGCCA CTGAATTTTT GGGAACTGCC ATTTTGATCA TTCTTGGGAA TGGTGCACTT 
GCCAACGTTG AACTTAAAGG TACGAAAGGT CACCAAAGTG GCTGGATCGT CATCGCTGTT 
GGTTATGGTA TGGGGGTTAT GATCCCAGCC TTGATGTTTG GTAACGTATC TGGGAATCAC 
ATCAACCCTG CTTTCACTCT AGGGCTTGCA GTTAGCGGTC TTTTCCCTTG GGCACAAGTG 
GTACCTTACA TTATCGCGCA AGTCTTGGGG GCTATCTTTG GCCAAGCCTT AGTTGTGGCA 
ACATACCGTC CATTCTACTT GAAAACTGAA AACCCAAATA ACATCTTGGG AACTTTCTCA 
ACTATTTCAA GTATTGACCA TGGTACAAAA GAAAGTCGCT ATGCAGCAAC TGTCAATGGT 
TTGATTAATG AGTTTGTTGG TTCATTTGTT TTGTTCTTTG CAGCTCTTGG TTTGACTAAA 
AACTTCTTTG GTGCTGAAGT GCTTCAATTC ATGAAACAAA AGGCAACAGA AGCAGGACAA 
ACAGTTGATT TTTCTGACTT GGCTATTAAA GCACAGGTGG CTCCACACAC TGCTTCAGGA 
CTTTCTGTGG CTCACTTGGC ACTTGGATTC CTCGTTATGG CTTTGGTAAC ATCACTTGGA 
GGACCTACAG GACCTGCCTT GAACCCAGCC CGTGACTTGG GACCACGTCT CCTTCATGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TTCCTTCCCA AATCACTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTGG 840 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC CATCTATAAG TAAGAGAGGA 960 

TCAGACTGGk TCTCTCTTTT kGATTTTTaG GGAAATGAAA GAAcTCTAAA CAAACTCCTC 1020 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT TCAATGGACT ATAGTAGGTT 1080 

GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA GAAATTGGTT TGAATTCTCC 1140 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGG TTGAAGTTGG AATAGTCCTC CCTTCTTTCT 1260 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAA? GTTCAGTTTC TATTTCATTT 1320 

TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT ACATTGACTC TGCTGAGTCC 1380 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAACTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAACGATT 1560 

TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC TGTTTACATT TTCAGCAAGT 1620 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGG ATAAGT CTATTTAGTA CTTTCTATTA 1680 

ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACGTT GTAAAACACT TGCAATTTAG 1740 

CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGGT AAAACTACCC TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G 1941 
<2> INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTACCCTGT TGTTAAAAAG ATAGGGAGTG 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 240 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 300 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 360 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 420 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 540 

ATGGTTAGAG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTGTCATTAT ATTTGCAACG 720 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 755 
(2» INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



CCAGAAAAAC 


CGTAGTGGAG 


CTCGTGGAAC 


AGTGGAATTG 


ATTTTCCAAA AAGAATACAA 


60 


TAAATTTTCA 


AGTATCTCAA 


AGAGGGAGGC 


ATAAGATGTC 


AGATGCATTT 


ACAGATGTAG 


120 


CCAAGATGAA 


AAAAATCAAA 


GAAGAAATCA 


AGGCACATGA 


GGGACAAGTC 


GTAGAAATGA 


180 


CTTTGGAGAA 


TGGTCGTAAG 


CGCCAAAAAA 


ATAGATTGGG 


TAAGCTAATT 


GAAGTTTATC 


240 


CATCTCTATT 


TATTGTGGAG 


TTTGGGGATG 


TGGAAGGAGA 


TAAACAAGTT 


AATGTTTACG 


300 


TTGAATCCTT 


TACTTACTCA 


GATATTCTTA 


CAGAAAAGAA 


TTTGATTCAT 


TATCTTCACT 


360 


AAAGTGAGAA 


ATTTTCTCAC 


TTTTTCTTTT 


TTCTCCGAAT 


AATTTAGGTG 


AAGGCAATCA 


420 


TCGCTTTATA 


TTATTTTTCA 


AGGAGGAAGA 


ATGAAAATTT 


TACCGTTTAT 


AGCAAGAGGA 


480 


ACAAGTTATT 


ACTTGAAGAT 


GTCAGTTAAA 


AAGCTTGTTC 


CrrTTTTAGT 


AGTAGGATTG 


540 


ATGCTAGCAG 


CTGGTGATAG 


TGTCTATGCC 


TATTCCAGAG 


GAAATGGATC 


GATTGCGCGT 


600 


GGGGATGATT 


ATCCTGCTTA 


TTATAAAAAT 


GGGAGCCAGG 


AGATTGATCA 


GTGGCGCATG 


660 


TATTCTCGTC 


AGTGTACTTC 


TTTTGTAGCC 


TTTCGTTTGA 


GTAATGTCAA 


TGGTTTTGAA 


720 


ATTCCGGCAG 


CTTATGGAAA 


TGCGAATGAA 


TGGGGACATC 


GTGCTCGTCG 


GGAAGGTTAT 


780 


CGTGTAGATA 


ATACACCGAC 


GATTGGTTCC 


ATTACTTGGT 


CTACTGCAGG 


AACTTATGGT 


840 
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CATCTTGCCT GGGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 
GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 
CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 
GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCGGAACT 
GTGATTGATT ACTATTATCC TGGGGAGAAG CTTCATTATG ATCAGATACT TGAAAAAGAC 
GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG 
GCTGTGAATA AAAATCCTCT AGGTAAtTCT Gl TO TllTf CAACAGGTGG AACTCATTAT 
TTTAAGACCA AGTCTCCTAT CAAAACTGAA CCCCTAGTTA CTGCAACTGT GATTGATTAC 
TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 
TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 
TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1483 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CCCGGAAAAC AAGTTAAAGT TCAAGTTGCT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 60 

GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 120 

TCGCAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 180 

AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 240 

ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CCCTTAATTT TAAGGAGAAC 300 

ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 360 

CTGGACACGC CGAGAGTGGC GAATACCGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 420 

TTCCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 480 

TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 540 

AGAGAGAAAT GACCCAGTTA TTCTTTCAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 600 

AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 660 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 720 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA 780 

CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 840 

GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 

TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 

GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 1020 

TTAAAAT 1027 

<2> INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 
(B| TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

CGGTTCAAAT GGTGCAGGTA AATCTACGTT AATTAATTCT ATTGTAGGTT TTCAAGAGAT 60 

TTATTTAGGA GAAATAGAGT ATTGTGATAA AGATTTGATA GTTAGTTCTC AACCTTTTGC 120 

TCATTTAGGC TTTACTCCTC AAACCACAGT AATTGATTTT TATACTACTG TGAAGGACAA 180 

TGTAATATTG GGGCTGAACC TTGCTGGAAA GTTTGGGAAA AATGCTGAGA AGTTGTGTCA 240 

AATAGCCTTA GAAATTGTTG GGTTAGCTGA TAAAAAAAAT AATTTGGTAG AAACATTGTC 300 

AGGTGGACAA CTGCAACGCG TCCAGATTGC TAGAGCAATA GCTCATAATC CAGATTTTTA 360 

TATTTTAGAT GAACCTACCG TTGGTTTAGA TACTGAATCT GCCGAAAAAT TTTTAATGTA 420 

TTTAAAAGAT AAGAGTTTGG AAGGAAAAAC TATTATCATA TCTTCACATG ACATAAATCT 480 

ACTCGAAAAG TTTTGTAAAA AAATACTTTT TTTACAAAAT GGCTCCATAT CATTTTTTGG 540 

TGATATGCGT GACTTTGTAG ATAATTCAAC TATCAAATTA AATTTTTCAA TGCAGAATAG 600 

AATTTCTAGA TATCAAATTG AATTTTTAGA AAATTTTAGA TTTAAAGTTC ACATCGAAGA 660 

TAATGATAGT TTTACAATAG AAGTCCCTAT AGAAGAAAAG ATCTTAGATG TTATCAATGA 720 

GGTAGGAAAA GCATGTGAAA TTAAAAACTT TTCAACAAGT AAATTAACCT TACAAGAAAG 780 

TTATTTGCAA AGAATAGGAG GAGAAAAATG AAGGCTGATC AATTAAGGCA CAAATCGGAC 840 

TTAGGTTTAA GAGGTCTAGC GATTATTGCT AAAAATGAGA TTATTGCTTT TTTTAGAAGT 900 

AAAGGTTTAA TTATTTCTCA GTTTCTACAA CCAATCTTAT ATGTTGTTTT TATAATAATA 960 

GGATTAAATT CTTCGATAAA GAACATTCAG TTTAATGATA TAAAAACCTC TTATGCAGAA 1020 

TATACAATCA TTGGTGTTAT AGCTTTATTG ATAATCGGGC AGATGACTCA AGTTATTTAT 1080 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 1140 

CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TAGGGTTGAT AGTTCAAGAA 1200 

ATTATTATAT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 1260 

TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGC ACTCCCTTGC AATTTTACTT 1320 

ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 1380 

CTTGGTTTTA CAGCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 1440 

ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 1500 

AATTCAACAA CTTTGGAATT ACTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 1560 

GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGCTCAT 1620 

CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 1680 

AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 1740 

CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 1800 

ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 1860 

CAGATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 1920 

AATCCATGAA GCGAATGCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 1980 

ACTTAGTGTT 1 990 
<2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCCCCATAT TTAGGTCCTG GACGTGGAAT 120 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 180 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 240 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 360 

ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 420 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TGTCATTTTA AGTACTCCAA ACTCCACAAA 480 
TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT GATGATAATA CAATTGATTG 540 
GCGTTATCAT CACGACGTTG ATTATAGTAA CTATGGATAC TCATATTCAA CATTGACAGA 600 
GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT GATTCATGGT CTCGTAATGA 660 
ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG ATTGAAGATC TGAAAAAGAA 720 
TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA ATTATGGTAA ATTACGGTAT 780 
TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA ATGAACAAAA TTGAAGATGC 840 
AAAAGTGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA GTTGCTCAAG AGTTGGGATC 900 
AGATGTTTGT GCAAGTTTAG ATGAACTTGT AGCACGTGAA GATATTGATT GTGTGATCGT 960 

AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA GCTGCTCAAC ATGGCAAACA 1020 

CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT TGTAAAGCCA TGGTTGACGC 1080 

ATGTAAAGAA AATAATGTCA TCTTTATGGC TGGTCACATC ATGAACTTCT TTAACGGTGT 1140 

ACACCATGCT AAAGAATTGA TTACTCAAGC TAAAATCGGT AAAGTTCTTT ATTGCCATGC 1200 

TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTATCA TGGAAGAAAC TTCGTTCTCA 1260 

ATCTGGAGGA CATTTGTACC ACCATATTCA TGAATTAGAT TGCATTCAGT TTATCATGGG 1320 

AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA TATCATAAAG GTGAAAACTT 1380 

TGGTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC TCTGATGATC GTTATGCTGT 1440 

TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC GTCTTGATTC AAGGAACTGA 1500 

AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT CTTCGTGTTA AAGGTGAAGG 1560 

AGAATCACAC TTCTTAGTTC ATGAAACTCA AGAGGAAGAT GATGATCGTA CAGCTATCTA 1620 

TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA CCAGGAGTAC GTTGCCCATT 16 BO 

ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA CATGACATCA TTAAAGGTGC 1740 

AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTGTA GCTGCTTTAG AATCAATCiiC 1800 

TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA AAAGTAAGTC TTTCAGAAAT 1860 

CACAAATGCT TAACTTTTGT AAAACAGAAT AGTAAATTCT TGTCATTATA TAATTTCTAA 1920 

AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG GGACTGGGAT AATGCCCACT 1980 

CCCATTTTTT ATCAAAAAGT AATGAGATCA AAAATGTGGG AGTGTTGAAA TGAAGATTAT 2040 

AGGTATCGAT ATTGGCGGAA CAACAATTAA GGCAGATTTA TACGATGAGT TTGGAACGAG 2100 

TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT GATTTGGGAA CGAATCAGAT 2160 

ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA AATCATTCAA TTGATGGTGT 2220 
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TGGGATTTCC 


ACTGCTGGAG 


TTGTTAATGC 


TAATACTGGA 


GAAATCATCT 


ATGCAGGCTA 


2280 


TACAATACCA 


GGGTATATCG 


GAGTAAACTT 


TACTGCCCAA 


ATAGAAAAAC 


GTTTTGGGTT 


2340 


GTATACTTTT 


GTTGAAAATG 


ATGTTAATTG 


TGCTGCATTA 


GGTGAATTGT 


GGAAGGGACA 


2400 


AGCCAAAGAT 


AAGAAAAATG 


TAGTAATGGT 


TACTATTGGA 


ACAGGTATAG 


GAGGCAGTAT 


2460 


TATTGTCAAC 


GGACAAATTG 


TTAACGGATT 


TAACTATACT 


GCTGGTGAAG 


TAGGTTATAT 


2520 


TCCTGTAGGT 


AATTCGGATT 


GGCAAAGTAA 


AGCCTCAACA 


ACCGCATTGA 


TTCATTTATA 


2580 


TCAAAAAAAG 


AGCTTGAAAA 


CTAATCAAAC 


TGGACGTACT 


TTCTTCACTG 


ATTTAAGATC 


2640 


TGGAGATAAA 


GTTGCTGAAG AAACTTTTGA AATTTTTGTA GAAAATCTAA CAAAAGGTTT 


2700 


ATTAACGATT TCTTATCTAC 


TTAATCCAGA 


AATTCTCATA 


TTAGGAGGTG 


GGATTCTGGA 


2760 


TAGT AAGGAT 


atttivvtac 


CTGAAATTCA 


AAGTTCTTTA 


GCTAAAAATG 


CAATGGATAA 


2820 


TAGGTTTTTA 


CCTAAAAATC 


TTGTGGCAGC 


TACATTAGGA 


AATGAAGCTG 


GTCGTATAGG 


2880 


AGCTGTAAAA 


AATTTCTTAG 


ATAGAATTTC 


TAATAAATAG 


TATGTAAGAT 


AAGGAGGTGT 


2940 


CACAATGACT 


AACTCTGTAT 


TTTCGACAAT 


GCAAGATATT 


GAGAATGTTG 


CAACCGATAT 


3000 


TATAAAATCA 


TATGATAATG 


AGATTTATAC 


TTATAAAGCT 


GTTTCCCAAG 


AAGAATTGGA 


3060 


AAAACTAGAA 


AAAAGTTATG 


ATGAAAAAAG 


TCACGAAGAA 


TTAGTTTCAA 


TAGAAAGCAA 


3120 


TTTAGAAATG 


AAACAACAGA 


ACCTTATTGA 


TGAGGTTAAT 


AAAACAATCA 


AGGAAAATGA 


3180 


TGCAAATATT 


CAGTATATTT 


CATCAAGTAG 


GAGAGGAGAA 


TTTGTAGAAA 


AAATTATTGG 


3240 


TAGGGTGGTA 


GAAAAATATG 


GCCATTAGTC 


AGATGAAAAG 


AATCTCTCTA 


CTATTTTCTA 


3300 


AAAGTAGTCT 


TGATGATGTT 


TTAAAAACTA 


TTCAAGAACT 


AGAGTCAGTG 


CAGTTCCGTG 


3360 


ATTTAAAGGT 


TCAGGATAAC 


TGGTCAGAAG 


CTCTACAAAA 


AGATGAAGTT 


GTATTTCCAA 


3420 


CTATTCAAAT 


TTTTCATACT 


TCTAATTCCA 


ATCATGGGGT 


TATTGAGGGA 


AATGATGCCT 


3480 


TGACTTATTT 


GATGAATCAA 


CAACAACATT 


TAGAAGCAAC 


TGTAGAGAAA 


TTACAAGAAT 


3540 


ACCTACCGAA 


AGAAAACACG 


TTTAAATTAT 


TGCAGCAACC 


TCCGATAACT 


ACCTCTTATG 


3600 


AAGAATTAGA 


GAAATTTGGT 


AAAGCTAATG 


TTGCTGAGGG 


TGTTCTTAAA 


AAAGTGAATC 


3660 


ATCAAATTAA 


CAGAGTTCAT 


GAATTAGAAA 


GACACATTCA 


AAGTAATAAT 


GAGGAAATAG 


3720 


AGCGATTAAT 


AAAGTGGGAA 


AAATTAGAAA 


TTGTTCCTGC 


GAATTTAGAA 


CAATTTTCTT 


3780 


TCTGTAAAGG AAAAGTCGGA ACAATTCCAA GGACTGAAGA TAATCGCTTA TACAATAGTC 


3840 


TTTTAGAAAA CAATATTGAA GTTCAAGAAA TATTTTCTAA TGATAGAGAG 


TACGGTGTTG 


3900 


TTGTTTTCTA 


TCAGTCTAGT 


TACTCTATAG 


ATTTTGATGA 


ATACTTATTT 


GAACCATTTG 


3960 
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ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATCC 4020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCCTTGCA AGATTCAAAG AAATATTTGA 4080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 4200 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 4260 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 4380 

CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 4440 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4500 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 4 560 

GTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4620 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTCTGTTTGG GTTTATTACA GTATTTGCAG 4680 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 4740 

CAGGATTTGC GTGGTGTGTT ATTCTG 4766 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 120 

AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 300 

TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 360 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 420 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 480 

GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 540 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAAGTGGA 


TGAAAATGGT 


GCTGCTCGCG 


TTATCCTGAG 


TCAGAAAATC 


TTGAAACAAT 


660 


ACAATATAAC 


CGATGCTGAA 


ACTGCGGCCA 


TTGTAGGTGC 


ACCTGGACGC 


ATTGACAGAG 


720 


TGAGTCTCTG 


GGGAATTTTT 


GTCGAACAGG 


CTGATGGCCA 


CTACCGAGTT 


CCCTTACGCA 


780 


GTAAACTCCA 


TCCTATCAAT 


GAAATTGCCA 


AGGAGCATGA 


TGGTGGAGGC 


CACCCTCTAG 


840 


CAAGTGGTGC 


TAATTCCTAT 


AGCCTAGAAG 


AAAACGAAAT 


CATCTACCAA 


AAGTTACAAG 


900 


ACTTGCTTAA 


AAACTGATAA 


AATACTTGCC 


AAACTTTTCA 


GAATCTGATA 


GACTAGTATA 


960 


GTAACAATCT 


ATGGCTCGCA 


AAGAGACCAT 


GGCAGAAAGG 


AAATATTGCA 


AAATGAAAAr 


1020 


AGATATCCAT 


CCAGAATATC 


GCCCAGTTGT 


CTTCATGGAC 


ACAACTACTG 


GTTACCArTT 


1080 


CCTTAGCGGT 


TCAACAAAAC 


GCTCTAACGA 


AACAGTTGAG 


TTCGAAGGCG 


AAACTTACCC 


1140 


ATTGATCCGT 


GTGGAAATTT 


CATCAGACTC 


ACACCCATTC 


TACACTGGAC 


GTCAAAAGTT 


1200 


CACTCAAGCA 


GATGGACGCG 


TGGATCGTTT 


CAACAAAAAA 


TACGGTCTCA 


AATAATGATA 


1260 


AGAGAACAGT 


TTTGGCTGTT 




TCTTGAAATC 


AACTGCTGTT 


TTCATGTTCC 


1320 


AGACTCATCT 


GTAGGTTCGA 


TTTCCATGCT 


ACTAGGCAGG 


AAGGAAATAG 


CTGTTTCAAC 


1380 


ACGTCCATAA 


TGAGCTATAC 


TATTGTCACG 


AACCACACTT 


TCATTGATGG 


TCCAACTGGA 


1440 


ATTCATTTTC 


TTAAAAGCTT 


CTCGGACTTT 


TTCCAAATCT 


TTGGAGGCAA 


TGGCCTGCTC 


1500 


TAAGGTTTCA 


AAACGAGGAC 


TTATACTCAT 


CTGCTTTCAA 


AAAGCATTCT 


AGTCCATCTC 


1560 


CGATTACCGA 


TGGACTTTAT 


CACCTCCTTC 


TCCAGTCCTT 


GTATGACATC 


TTGAAGTTGA 


1620 


TTCATGACAT 


CTTCCAAAGT 


TCgAAAGGCT TTATTCTTAA 


ATCCACGTTT 


ACGAATCTCT 


1680 


TTCCACACTT 


GTTCAATGGG 


TTCATCTCTG 


GTGTGTATGG 


AGGAATAAAG 


GTAAAATCAA 


1740 


TATTAGTCGG 


AATATTTAAG 


GTACTTGATT 


TATGCCATAT 


AGCATTGTCC 


ATAACGAGTA 


1800 


AAAGGATAAG 


CTTGTGAAAG 


CTCTTCTAAA 


AAGGCGTTCA 


TCCACACTCC 


TTTTTATAAA 


1860 


CCTGAAATAA 


GGCATCAATT 


GTAACAAATT 


CTCCTGCCTC 


TGTAGCCTTC 


AAATGACGGG 


1920 


CAAGAAAGGC 


TTTCTCTTCC 


TCAACTGTCA 


TATATGCATG 


GTTACGACCA 


CCACGTGTTT 


1960 


CTTGAAGGAG 


AGAGTCGAGT 


CCGAACTCCT 


CATATTTTTT 


TACGTTTCGC 


CAAATCGTTG 


2040 


TTTGATTACA 


GTCTAAAAGC 


TCTATAATCT 


CTTTATAAGA 


TTTGCCCATC 


AGACGAAATA 


2100 


TAGTAGATTG 


AAACTAGAAT 


AGTACACCTC 


TACTTCTAAA 


ACATTGTTAG 


AAATCGATTT 


2160 


GTCCTGTTCT 


TGTTTCATTT 


TACTATAGAA 


CGATTTGAAG 


GCGTTTATAA 


TATTTAGCTG 


2220 


TACGAGAGTC 


TTTTAAAAGT 


GTTTTGATGG 


TTTGGATTTC 


TTCTTTAGTT 


GATTTCATAT 


2280 


TACTATTATA 


TAATGCTTTT 


TGATTTTAGT 


CTGGTATAAA 


TATTGCTTTC 


CTCCAAAATG 


2340 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGCATAAA TTAACAAGTG ATTTCTGAAT 2400 

TGCTAAACAT TTTCTTTTCT TAT AG CAT AC TTTAAGATTT TGTCTTTGAG AAAGATATTT 2460 

CCAAGAAAAA CGTTCGTTTT TTGC 2464 

(2i INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 120 

GTTCGTGAAA TGGCAAATGA TAATAACAGC CATTATTTGA TATACAGAGT TTTAGGTATT 180 

TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTCGTTT TTTATACAAA 240 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 300 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAGTA CAAAACCTAA GACTTTTGAA 360 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 420 

GATGGAGACC ATATAACTAA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 480 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 540 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTGG 600 

GAACATTTAA GAGCAGTGAC CGGTATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 720 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA 780 

AAACCCAAAA ATTATCTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGAdT 840 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 1020 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAACGA AAACGAGATG GAAACTCTAA AACTATATAT AAGGTTGATA 1200 

ATAATGGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 1260 
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ATATACCATT TCTTAATCCA AAAGCTAAAC AAAGAGTAGG TTATCCTACA CAAAAACCTA 1320 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 1380 

CGTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 1440 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGGAA AATGTTATAA 1500 

AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAAGCATA TAGAACCAAA ACTGAGGAAG 1560 

AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 1620 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 1680 

AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 1740 

TTGGAGTAGT TATAAAAACT CATTCG X766 

(2) INFORMATION FOR SEQ ID NO: 236: 

ii) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 120 

GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 360 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 480 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCOGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 748 
(2) INFORMATION FOR SEQ ID NO: 237: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23Tr~ 

AAAAGATTAC ATTGCAACAA TTGAAAATTA TCCAAAGGAA GGCATTACCT TCCGTGATAT 60 

TAGTCCTTTG ATGGCTGATG GAAATGCTTA TAGCTACGCT GTTCGTGAAA TCGTTCAGTA 120 

TGCTACTGAC AAGAAAGTCG ACATGATCGT GGGACCTGAA GCTCGTGGAT TTATCGTGGG 180 

TTGTCCAGTT GCCTTTGAGT TGGGAATTGG TTTTGCGCCT GTTCGTAAGC CAGGTAAATT 240 

GCCACGCGAA GTTATTTCTG CTGACTATGA AAAAGAGTAC GGTGTCGATA CCTTGACTAT 300 

GCACGCGGAT GCCATTAAGC CAGGTCAACG TGTTCTTATT GTAGATGACC TTTTGGCGAC 360 

AGGTGGAACT GTTAAGGCAA CTATCGAGAT GATTGAAAAA CTTGGTGGTG TTATGGCAGG 420 

TTGTGCCTTC CTTGTTGAAT TGGATGAATT GAACGGCCGT GAAAAAATTG GTGACTACGA 4 60 

CTACAAAGTT CTTATGCATT ATTAATGAAA ACAGTCCCTA GGGCTGTTTT CTCTACACTA 540 

GGATATAAAA ATAGACTATA ACTAGTTAGA GAAAAACTAT AATTGAAAAC TATATCTTCT 600 

TGCAGTATAA TAAAAGGACT AAGTGTTTGA GATTTGTCTT CAAACATATG CAATTATTCC 660 

TGAAAGAGTA CAGTTAGGAG AGGGTTATGC CGATTCGAAT TGATAAAAAA TTGCCAGCTG 720 

TTGAGATTTT AOGGACAGAG AATATCTTTG TCATGGATGA TCAACGTGCT GCCCACCAAG 780 

ATATCCGTCC TTTGAAGATT TTAATTTTAA ATCTCATGCC ACAGAAAATG GTCACAGAGA fl40 

CCCAGTTGTT GCCCCACTTG GCTAATACAC CCCTACAACT GGATATTGAT TTTCTCTATA 900 

TGGAGAGCCA CCGTTCTAAA ACAACTCGTT CAGAGCACAT GGAGACCTTC TATAAAACTT 960 

TTCCTGAAGT CAAGGATGAG TATTTTGATG GGATGATCAT CACGGGTGCT CCAGTTGAGC 1020 

ATTTACCATT TGAGGAAGTG GACTATTGGG AGGAATTTAG ACAGATGCTT GAGTGGTCTA 1080 

AGACTCATGT CTATTCGACC CTTCATATCT GTTGGGGGGC TCAGGCTGGG CTTTATCTGC 1140 

GCTATGGTGT AGAAAAATAC CAGATGGACA GTAAGCTATC AGGTATTTAT CCTCAGGACA 1200 

CCCTAAAAGA GGGTCACCTT CTATTTAGAG GCTTTGATGA TAGCTATGTA TCCCCTCATT 1260 

CACGGCACAC GGAGATTTCT AAGGAAGAGG TCTTAAACAA GACCAATCTC GAGATTTTAT 1320 

CAGAAGGACC TCAGGTTGGG GTTTCTATTw TGGCCAGTCG TGATTTACGA GAAATTTATA 1380 

GTTTTGGTCA TTTGGAGTAT GACCGTGATA CTTTGGCAAA AGAGTATTTT CGAGATCGTG 1440 

ATGCAGGTT 1449 
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(2) INFORMATION FOR SEQ ID NO: 238: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTACT CCTAATGTAT AGTCTACCAT 120 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 240 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 420 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 480 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 540 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TGTCTTTTTC CGTGTATTCA CAAGAATCTC 600 

CTTTTTAACT G CAT AT AC AC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA 9Q4 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 



60 
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CGTAACATTC ACTTTGCTGT TCGTGAATTT GCAATGGCAA CAATCTTGAA CGGGATGGCC 120 
CTTCATGGTG GACtTCGTGT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 180 
GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 240 
TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 300 
CGTGCTATGC CAAATCTAAA TGTTTTCCGT CCAGCAGATG CGCGTGAAAC GCAAGCAGCT 360 
TGGTACCTTG CAGTGACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 420 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AACGTTGCTA AAGGTGCTTA TGTTGTATAT 480 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCGTAGT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA GCTTACAAGG AAGAAATTCT TCCAAATGCA 660 

GTCCGCCGTC GTGTTGCAGT CGAAATGGGT GCAAGTCAAA ACTGGTACAA ATATGTTGGT 720 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CCAAAAGTAT 780 

TGGCAGAATA TGGCTTTACT GTAGAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 840 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAAGGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 946 
<2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 240: 
CGGGGCTCCc TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAACAC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

GTCTCAAAGT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 
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TGGGTCACTA TGAACGTTTC ATCGATATCA ATCTCAACAA ATATTCCAAC GTGACAACTG 600 

GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCGTGG AGAATACCTT GGGGCAACTG 660 

TTCAAGTCAT TCCTCATATC ACAGATGCTT TGAAAGAAAA AATCAAGCGT GCCGCTCTAA 720 

CGACCGACTC TGATGTCATT ATCACAGAGG TTGGTGGAAC AGTAGGAGAT ATCGAGTCCT 780 

TGCCATTCCT AGAGGCTCTT CCTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 840 

ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 900 

CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 960 

GTACAGAAGA GCCAGCTGGT CAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 1020 

CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TTACCAAATT CCACTGAACT 1080 

TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 1140 

CGGATATGAC AGAATGGTCA GCCATGGTGG ACAAGGTCAT GAACCTCAAG AAACAAGTTA 1200 

AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 1260 

CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTGG GTCAATGCCA 1320 

ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TGCGGACGGG ATCATCGTAC 1380 

CAGGTGGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCGC TATGCGCGTG 1440 

AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATGCA GTTGACATGT ATCGAGTTTG 1500 

CTCGTCACGT TTTAGGTCTT GAAGGTGCCA ATTCTGCAGA CCTTGCACCA GAAACAAAAT 1560 

ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 1620 

GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA CGCTGCTGCT GCTTATCACA 1680 

ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 1740 

AGCAGTTTGA GGCAGCACCT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAG 1800 

AAATCGTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 1860 

CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 1920 

ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 1980 

ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 2040 

AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 2100 

AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 2160 

TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 2220 

AGAAATATTT TAGCAGTCAA GATGTACTGT TATAGATTCA ATACATTATA CTTTTTTAAT 2280 
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TTAATCCACT 

ATTTCTAACA 

ATCATTCATA 

TTCTTCCTCC 

ACATTTCCTC 

TGTTCTTTAA 

ATTCAGGTTG 

GAGACTAGAC 
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ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 
ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 
CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 
TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCCCT 
AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 
TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG 
ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 
AATTTCAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 



AAGA 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2764 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

CCGTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 60 

CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 120 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGCTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 240 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 300 

AACTCTTACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 420 

AATGAAGATC ATAGACTTCT ACTAAGCGTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT AGTATCTAAT AACTTATCAT 540 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGGTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

TTTTAGCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AATATCCATA CCTCTGAGTG 720 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 780 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 

GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 900 
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CTTCTTGAAT ACTCTCTTGC CCCCAGATAA TCTCTGTCTT TTCACCATAT ATCTCACTGG 960 

TTTTAGCTGT ATTTGTATTA ACAGCTACTG TCACAACTTC TGGGAAATCT TTAACCAACT 1020 

CTTTTACCAA TTGAGTTAAA TTAAGCTGGC GGTTTGTAAC AATAATAATC TGAACCTGTC 10 BO 

CGGTCTTTCT CGCGCGTCGG ACCATAATAG TACGGACACC TAGAACTTTT CTCTCATCCG 1140 

TGATTGGAAT CTGGTGATAA GTAAGTAATT CTGCTAAGCG ATTAGCAATC ACTTGGGTTT- 1200 

CCTTATCTTG TACCAGGCAG TCTTTCAACT CTACTAAATA GTGAGAGTTT TGTGCATATA 1260 

AGCCCGCCTT GACCTGATTT TTAAATTTTC GAGTCTGAAA TTGTAACTTA GCTCTGTAAT 1320 

ATTTTGGTTC CTGCATTCCA ATAGTTGGAC GAATTTCATA ATTTTCATAT CCTGCAGGAG 1380 

CAAATTTTTT CAGCGCTTGA TGAAGTAAGT CCCTCTTGAA CTCCAGCTGC TTATCATAAT 1440 

GCAGGTGCAT GATTTGGCAG CCTCCGCATT CATTATAAAT AGTACAAGAT GGCACAATTC 1500 

GAAATTTAGA CTTCTTGTTG ACCTTCAGTA ATTTTGCTTC AACAAAGTTG CGTCTAATAG 1560 

AAGTAATCTG ACAATAGATA TCTTCGCCTT TGAGAGCTCC TGGTACAAAG ACTAATGTTT 1620 

TTTGGTAAAA CCCGATTCCC TCACCGTTAA TTCCCATGCG CTTGATTTTT AATGGTATTT 1680 

TT 1682 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2S24 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 60 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 120 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 360 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACGCC TGCTGCAACT 420 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 480 

AAACTTTCCA AACTGTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 540 
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AAATTCTCTA 


TTTCAATTTT 


AACAATTCCT 


ACATTCCTTG 


CTTCTGTTAA 


CATTCTACTA 


600 


ATAGAGGTTC 


TATAAATTCC 


TAATTTTGCT 


GCTATTTGTG 


ACTGATTTAA 


GTTTTCAATA 


660 


TAATACAGAT 


AAGCAATTTT 


AGAAAGCAGT 


TTATTCCTAT 


CTTGATTCAT 


ACACTTAACC 


720 


TCTTACGAAA 


CTACCTTAAC 


CATTATCCCA 


GCATTTTCTA 


ATGTAGCTAT 


ATTTTUTTTA 


780 


GAAAGTTTTT 


CGTCTGTTAT 


TACTTCATAG 


ACTTGACTTA 


AAGCAAATCT 


TCTTACTGTA 


840 


CCTCTTTTAT 


CAAATTTACT 


TGAGTCAGTT 


AGGACAATGA 


CTTTATCCGA 


CACTGCTGAA 


900 


ATATATTGAA 


CTACCTCACT 


GCGCATTAAA 


TCTTTTCCGG 


TAAAGCCCAT 


CTCTTTATCG 


960 


TAACCATCTG 


TCCCAACAAA 


AGCTTGACAC 


ACATGAAAAG 


TCTGTATCAT 


TTCTTTTAAT 


1020 


AAAGGTCCTA 


CACTCACCTG 


TGAATCTTTC 


TGAAACTCAC 


CACCAAGAAC 


AATAACACGA 


1080 


CATGAATCAT 


AAGCTCTCAC 


AAAATTTGCT 


ATAAAAAACG 


AATTTGTTAC 


AATCGTAACA 


1140 


TTTCTTTTTT 


GCTTGCAAAT 


TTCCTCAGCA 


AGTAAAGCAC 


AGGTCGATCC 


AGATTCTATC 


1200 


ATTATTGTTT 


CATTATCTGA 


CACCAATTTT 


ACTGCTTCCT 


GAACAATTTT 


TCTCTTACIT 


1260 


TCATAATTAA 


TTGACAAACG 


TACATTTAAG 


TCATCTCCAC 


TATTTAATAC 


AGCATATCCA 


1320 

A W 4* V 


TGCTCTCTGT 


GTAATAAACC 


TTTTGACTCT 


AATTTATCTA 


AATCTTTTCT 


AATCGTTACT 


1380 


TTCGATACAT 


TTAATTTTTC 


CGATAATGTA 


TTAACGTCGA 


TCTTTTCATA 


TTCTGATACT 


1440 


AATTTAATAA 


TTTGTTCCAA 


TCTTTTCATT 


TTACACCTCC 


GTTTTATTCT 


ACCAAAATAA 


1500 


AAAGCAAAAA 


ACAACAAATT 


AACCTTTCGT 


TCGTAATTGT 


TTTTCTTTCG 


TTTTTGTGAT 


1560 


AGGATAGACT 


TATGAAGAGG 


AGGAACTCTT 


ATGGAAATAT 


CTAAAGGAAT 


TATTTTTAAT 


1620 


ATTCAACACT 


TTTCAATTCA 


TGACGGTCCG 


GGTATTCGTA 


CAACTGTTTT 


TTTAAAAGGA 


1680 


TGTCCTCTGC 


GCTGTCCATG 


GTGTTCTAAT 


CCTGAATCTC 


AAAGAATGAA 


ACCTGAAAAA 


1740 


ATGAAAGATG 


CTCAACGAGA 


GAAATTCACC 


TTAGTCGGTG 


AAGAAAAGAC 


TGTAGAAGAA 


1800 


ATTATTACAG 


AGGTATTAAA 


AGACAAAGAA 


TTTTACGAAG 


AATCCGGTGG 


AGGTTTAACT 


I860 


TTATCAGGAG 


GTGAAATATT 


TGCTCAGTTT 


GAATTTGCTA 


AAGCCATCTT 


AAAATCAGCT 


1920 


AAAGAACATC 


ACATACACAC 


TGCCATTGAA 


ACTACTGCCT 


TTGTTGATCA 


TGAAAAATTT 


1980 


ATTGATTTAA 


TTCAATATGT 


GGATTTTATC 


TACACAGACC 


TAAAACATTA 


TAATTCTATA 


2040 


AAACATAAAA 


AAGTGACTGG 


GGTTTTTAAT 


CAAATGATTA 


TTAAAAACAT 


TCATTATGCT 


2100 


TTTTCACAAA 


ATAAAACTAT 


CGTTTTAAGA 


ATCCCAGTTA 


TTCCTAATTT 


TAACAATAGT 


2160 


TTAGAGGATG 


CAGAAAAATT 


CGCTACTCTA 


TTTAACTCAT 


TAAATATCGA 


CCAAGTTCAA 


2220 


CTACTCCCTT 


TTCATCAATT 


TGGTGAAAAC 


AAATATCGTT 


TATTAAATCG 


GAAATATGAA 


2280 


ATGGATGGAA 


TCAACGCACT 


TCATCCwGAA 


GATCTTATTG 


ATTATCAAAA 


GGTATTTCTG 


2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG 2400 
ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 2460 
CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 2520 
AACG 

(2) INFORMATION FOR SEQ ID NO: 243: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



2524 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCAGACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGCTGACTTG AATGATGTTC 120 

GGATGATTCA CTCAAGTAGT GGTATTCACT CCTTTGAACC TTCGGCAAAT GATATCGCAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 240 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGCTCTT AGAGGCTTCT GAGGGAATGA 300 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 360 

CCCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 420 

TCGCTGATAA ACTTTCAGAG GTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 480 

AAGCCTTTAT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCAGGTATCT CTCCTGAACA AGAACCAAGT CCACGACAAC 660 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 720 

ACGCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAACTC 780 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

ATATGAGTAT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 900 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGACGTTAC 960 

CAAGCTGGTC AGGATAAGAA ACAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGCT 1020 

GGTCAAAAGG CAGAAAACTT GACACCAGAT GAAGTCAGTA AGAGGGAGGG GATCAACGCC 1080 

GAACAAATTG TTATCAAGAT TACGGATCAA GGTTATGTGA CCTCTCATGG AGACCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTGAAGAGCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG AAATCAAGGG TGGTTATGTC 1260 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 1320 

CGGACAAAAG AACAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 1380 

GCAGATAATG CTGTTGCTGC AGCCAGAGCC CAAGGACGTT ATACAACGGA TGATGGGTAT 1440 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGCC 1500 

GACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA CCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 1620 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT ATGCTAAACC CTTATCAGAA 1740 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

ACAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT 1860 

GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 1920 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGAACCTAG TCCAAGTCCG 1980 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 2040 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT ATATCCCAGC CAAGGATCTT 2100 

TCAGCAGAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA AGCAGGAAAG TTTATCTCAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2280 

TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAOGATG TCTCAAGTGA TAAAGTCAAG 2340 

TTAGTGGAAG ATATTCTTG 2359 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 120 

AGCGATTCAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA CGCATCCCCT GATTAAATAC TACCGCGACT ATGTTGCGGG TAACGATATT 240 

AGCAAGAACC AAAGCTGGAT TATCGACCCC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 300 

AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTGGCCAC TCTTCCTATT 360 

GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG CTGAAAATAA CAATGCCAAA 420 

TTGATTGCTG CCTACTTTAA CCAAAATGGT ATCGATGCAC GCTATATGCA CCCTAGAGAA 480 

GCTGGGATTG TGGTCACAAG TGAACCTGGT CACGCTCGCA TCATTCCATC AAGTTATGAC 540 

AAGATTGAAG AATTGACAAA CACCAATGAA GTCCTTGTCA TTCCTGGTTT CTTTGGTGTC 600 

ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 660 

ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 720 

GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 780 

ATGCGCGAGT TGGCCTATGC AGGCTTCTCA GTCCTTCATG ACGAGGCTCT TCTTCCTCCC 840 

TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 900 
CGTATCGTTC TAAAACACAG TAATGATGAA TTTCCAGTTG TGGGAATTGC TGGTGACTCA 



960 



1052 



GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGG ATTTGGCCGC 1020 

AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 855 base pairs 
fB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO; 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 120 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCCCA 360 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 420 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGCA 480 
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AACATTATGG GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 540 

CTGATACTTC GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGCG CAAAATAAGC 600 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 660 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 720 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 780 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 840 

CTTAACGAAA CATAT 855 



(2) INFORMATION FOR SEQ ID NO: 246: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 



TTTAGGAAGG 


CTATCCGTAA 


TTTTACAAAG 


GATTTAGATA 


TTACAGAGGA 


ACATTTAGAT 


60 


ATTATCAAAA 


GAGAGATGTT 


TGGCGAATTT 


TTCAGTAGCA 


TGAACTCTCT 


TGAATTTATT 


120 


GCAACGCAAT 


ATGATGCTTT 


TGAAAATGGT 


GAGATAATTT 


TTGATTTGCC 


GAAAATTTTA 


180 


CAGGAAATTA 


CTTTAGAGGA 


TGTCCTTGAT 


GCTGGACATC 


ATTTAATAGA 


TGATGGTGAC 


240 


ATAGTTGATT 


TTACAATATT 


CCCATCGTAG 


TAACCTATTA 


TAATAGACAC 


TAGAAAGAAG 


300 


GGATGACAAG 


TATGAGAAAA 


AAAACAATTG 


GAGAGGTTTT 


ACGATTAGCT 


AGAATCAATC 


360 


AGGGATTGAG 


TTTAGATGAA 


TTGCAGAAAA 


AGACAGAAAT 


CCAGTTAGAT 


ATGTTGGAAG 


420 


CAATGGAAGC 


AGACGATTTC 


GATCAACTTC 


CAAGTCCTTT 


TTACACGCGT 


TCTTTCTTGA 


480 


AAAAATATGC 


ATGGGCTGTT 


GAGTTAGATG 


ACCAAATTGT 


TTTGGATGCT 


TATGATTCTG 


540 


GGAGTATGAT 


TACTTATGAG 


GAAGTAGATG 


TTGATGAAGA 


TGAGTTGACA 


ggtcgtaga: 


600 


GTTCAAGTAA 


GAAAAAGAAG 


AAAAAAACAT 


CATTTTTACC 


TTTATTTTAT 


TTTATCCTGG 


660 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC AGGATCGTGC ATAGTCAACT CTTCAAGTAT AGCATATCTC CTATTTTCTT 60 

ACAAGTAATA ACACCTAAAA TGAAGCTTTT TCTTTTACTT TTTTCTGCCA AGAGGCAAAA 120 

AGCATGCTCA GGTAAAAAAC GCTCATCATA ATAGGAACAC CAAGAATCGT CTTTTCATGA 180 

TAGAAAATCG TCAAATAGGC TGAAAAGACA ACGCCAAGGA CAAAACTACT AAGCAGGCTA 240 

ACAAATATGA ATCCTTCACG CAAAAAAGGA GTGTGCTTGG TTCGGAAATA ATCTCCAAAA 300 

GCCAGCATGG TCCGTTTGAT ATTCCCTGTC ATAAAAGCGT TATTATAGGC AATACCCGAC 360 

ACTTCTCCAA AAGCAGTTGT CACCAGTCCC ATACAGAAGG CCAAGGCCGG CACTAGATAG 420 

ATATTATCCA CAGTTTGCGG CACAAAAGCA ATAATGATTG ATAAGATTGC CAAGGGAATC 480 

AAGGACAGAA TAGGTTTTTT CACAATTCTC AATTTTTCCT TATAAATCGT TAATAAAAAG 540 

ACTCCCATCA TAAACGCTAG CAAGGTGAGA ACCTTGTCCC TAACATCCGA AACATTATTT 600 

TTAATTAATT CTACTGAAAG AAAGACAACA TTTCCAGTTT GTCCAGCTAC AAGGGTATTC 660 

CCGCGAACAA TAAAAGTGTA AGCATCCACA TATCCAGCAC AAAACGTCAA AAAAAGTGCT 720 

AACCTTTTAG ACTGACGTGA TATTTTTCTT ATAGGTAATA ACCTCATTTT ACCTCCCATT 780 

GTATTTTCTC TTAGAAATAT TGTACCATTT TCTTTCTAAA AAATCGTAGG CTACCATTTA 840 

GATTTTACTA TTAGCATAAA AATAATAATA GACAACTATT TATCCAAAAA TAGATAGATG 900 

TAACATGTTT GCAAACAAAG CATACGAACC TTTAGTAAAA TCATTTCCAT GAAACTAGAA 960 

TAGAGCCCTC TTAGCAAAAA TCATTATTTT AATTTATTTC TAATCACTCC TTGACATAAA 1020 

TAACTCTCAC CAATAAAAGA CTATGTCTTA AAAAAATGGT ATAATAAAAT CAATACTTGG 1080 

GCTTGATGGC TATGCTACTA ATAACAATTA GGAGAGAAAA TCAGGCACTT CTTAACAACA 1140 

AGGATTATCC CCTTGAGATG AAAGGAACTT TAGAAATCTT ATGATGAACA TGCAAAACAT 1200 

GATGCGTCAA GCACAAAAAC TTCAAAAACA AATGGAACAA AGCCAAGCTG AACTTGCTGC 1260 

TATGCAATTT GTTGGCAAAT CTGCTCAAGA TCTTGTCCAA GCGACCTTAA CTGGCGATAA 1320 

GAAAGTTGTC AGCATTGATT TCAATCCAGC TGTCGTTGAC CCAGAGGACC TTGAGACTCT 1380 

TTCTGATATG ACCGTTCAAG CCATCAACTC TGCTCTTGAA CAAATCGATG AAACTACCAA 1440 

GAAAAAACTG GGTGCTTTCG CTGGGAAATT ACCTTTCTAA AAACAAGGAG CTAGAACAAT 1500 

GCTTGTCGAT AACAAAGGCT AAGAAAGGTG CAAAAATGAC TCTATAATAT TTGTAGTGGG 1560 

TAAATCCCCT ATGGATATTA TGGAGCCTAT TTTTGTGTAG AAAAAAGTCC CATATGACCT 1620 

ATAATGAAAA GCGACAAAAC AACTCATTAG AAAGAATCAT ATGGAACAAT TACATTTTAT 1680 

CACAAAATTA CTAGACATTA AAGACCCTAA TATCCAGATT TTAGACATCC TCAATAAGGA 1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 
AAGTC 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2516 base pairs 

(B) TYPE: nucleic acid 

(CI STRANDEDNESS : double 

(0) TOPOLOGY: linear 



1800 
1805 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO; 248; 
CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 
CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 
GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 
CCTATAGTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 
TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 
CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA CATTCTTATT 
AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 
ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 
TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 
GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 
TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 
TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 
ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 
TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 
TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 
CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 
CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 
CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 
AGAACCTGTG ACAACGCTAG CTTTGCGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 
TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 
ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 
GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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TTGCGCtGTC GCCAATATTC AATCCATCCA AATCTATTAG AATGGATCTT AGTTTTACTT 1380 

CAAGATATGA CGACTGGAGT ATATTGCTTT CCGTTCACAT ATATATTGTT CTTTTTTTAT 1440 

TTGATGAATA ACTATTTTAA TAGGTTGGAC TGTCGCATTC GTCTGAAATC AATTAAGCAC IS 00 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGGAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 1620 

GAGGTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TAGTTTCTTT 1680 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACTATTAGC 1740 

AGTTTTTCTT GGTTTAAAAA ATCAAACATG AGCTTACTAT TTCTGTTTAC TTTTTTATTT 1800 

GTAGAATCCT TATTCTGGAT TTATCAGTTG GACAATGGGA TAATTGGATT ATTGCCAATT 1860 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATTACTATCT 1920 

ATCATAATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 1980 

GAAATGGGAA AGTTAAGTAG TCACATGTGC AGGTTGAATC AGATAATCTA TACCAAGTAC 2040 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTCTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGAGTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATGGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2220 

GTATATCGTA GAAGAGCATC CGATTTCTTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 2280 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TT T CTTTATG GACTCATTTT AATCATGGTA 2340 

AAAGTTCTAG TGATTCAATT TATGTTACAG TTACCAAACT GGGATATAGG ACTTTTGTTT 2400 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA GTCTTGTTTT GCTTTATGTT ATACGCACTA 24 60 

GGAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2516 
(2) INFORMATION FOR SEQ 10 NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 
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ACGACCTTAT GGAGCAATAT CAAGTGAGTC CTGATACCGT CCGAAAGcCC TGTCTCTCCT 
CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAAGCT TCTCAAGTCG TCAAAGAAGA 
AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 
ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC 
ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 
GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 
CACTCGCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 
TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 
CATGGACATT GCCAAAGACC CTTATCTCGT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 
CGGACGCCAA TTTCAGTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 
TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGCTCTA 
TAATATTTGT AGTGGGTAAC CCCCCTATCG ATATTATGGA GCCTATTTTG TGTAGAAAAA 
AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 
AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 
ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGCCTCCAT 
CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 
CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCGT CGTTTCAAGT 
GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 
TTCCTCGTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 
ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 
(2) INFORMATION FOR SEQ ID NO: 250: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1364 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 
GACAATCGGT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 
CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 
TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 



60 
120 
180 
240 
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TAATAAACAG TATGAAAATC GTATCAGGAA TCTATGGGGG ACGTCCCCTC AAGACACTAG 300 

AAGGCAACAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 360 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG ACCTTTATGC AGGTAGTGGT GGTTTATCTA 420 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTCTTTTGGT GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 540 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 660 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 720 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 7 BO 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTGG 840 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 900 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG CGGGTTACAA AAGGCTGTGA 960 

AACATTTGGG AAATG7TAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTC GATGTCGCAA 1020 

AAAGACTGGG GGCTACTTCC CTAGTGCGAG CTTTGAGAAA TGCGTCGGAT TTGCAATATG 1080 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAC 1140 

ATAGTCGACC TGAACATCTC TATATCAGTT CATCAGGCGT TAGAGAGCTT TTGAAGTTTG 1200 

CTCAGGATAT TGCCTGCTAT GTTCCCG 1227 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS : double 

(D) TOPOLOGY: linear 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 180 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 240 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 360 
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ACACGGCTAA TTCGTCCACC TTTGACAAGA TTTTCCAAAG TTGAAACACC AATATAAAGC 420 

TCTGTATGGT TTTTAACCTC TTCTACATGA GATAAAATTG CCTCCATATC TTTACCTTCT 4 BO 

TGAGCTAACT TCGCAGCCTC AACAACTTGG AATTTCAGGG CTTGGTCAGT GAAGGAACTA 540 

TCAACAACAG TCACATCTGC AGTAGATAGG CTAGCACCTT GGCGTCCTGC TTCTACCGTA 600 

CCCGAAAGAG CATGGGACAT ATGAATAGCA AGAATCTGGC CACCATCTTT GCATAGGTCT 660 

TCAAAAATCT CAGCAAAGAC ACCTACAGGT GGCTGACTTG TTTTCGGAAG ATTCTTACTT 720 

TCTTGCATCA ACTGAAGAAA TTTACCTTCT TCTTTCAAAT CCGCATCAGA ATAAACAACA 780 

TTATCAATCA TTACAGATAA TGGAACAATT GTAATATCTA ATTGCTTTAC TAGTTCAGGT 840 

TCAATAGTAA CAGATGAATC GGTTACAATC TTAATTTTTG TCATAGTATC AATCTTTCTA 900 

TTTTAGGATT CAGATTGGTT TCCTTACTTC TAATTATATC AAAAAAAAGA TTAAAAATCC 960 

TAATGGAGTC AATCAAATTT TCCGTAAAAT TTGATATAAT CAACTTATAA GAAAAGAGGT 1020 

GTCCTATGAT TAAAAAAATT TACCCCATTT TTACCATTTT ACTAGGTGCT GCTATTTATG 1080 

CTTTTGGACT GACTTATTTT GTAGTTCCCC ATCATCTCTT TGAAGGAGGG GCGACAGGCA 1140 

TTACCCTCAT CACCTTTTAT CTTTTTAAAA TCCCTGTTTC CCTCATGAAC CTGCTGATTA 1200 

ATATTCCCCT TTTCATCCTA GCTTGGAAGA TTTTTGGAGC CAAATCCCTC TATTCTAGTT 1260 

TACTAGGAAC CTTAGCTTTG TCCGGCTGGT TAGCTTTTTT TGAGCATATT CCCCTTCATA 1320 

TTGATCTTCA AGGTGATTTA CTAATCACAG CCCTTATAGC GGGAATCCTA TTGGGAATTG 1380 

GCCTTGGAAT TATTTTTAAT GCTGGAGGTA CAACTGGCGG AACTGATATT CTAGCTCGTA 1440 

TTCTCAACAA ATACACTCAT ATATCCATAG GAAAACTGCT CTTTATCTTA GATTTTTGTA 1500 

TTCTCATGTT GATTCTCCTA ATCTTCAAGG ATTTGAGATT GGTTTCCTAC ACGCTTTTGT 1560 

TTGATTTTAT TGTTTCTCGT GTTATTGATT TGATTGGTGA AGGAGGATAT GCCGGCAAAG 1620 

GCTTTATGAT TATCACAAAA CGTCCTGACC AACTTGCTAA GGCGATTAAT GATGACCTCG 1680 

GAAGAGGTGT TACTTTTATT TCTGGTCAAG GCTACTATAG TAAAGAAAAT TTGAAAATCA 1740 

TCTACTGTAT TGTCGGAAGA AATGAAATTG TGAAAACGAA GGAAATGATT CATCGAATCG 1800 

ATCCTCAAGC CTTTATAACT ATTACAGAAG CCCATGAAAT CCTAGGAGAA GGCTTCACCT 1860 

TTGAAAAAGA ATAAAAAGAG GTAATGTCGT GACCTCAAAA GTTAGACTAA ATCATCTATC 1920 

TTTTGGGTTA CAGACAACCT CTTTTTTATT TTATTTACTC AAGCTCTTAA GACCAATTCC 1980 

GAGTTACTTC TTCATCAGCC TTTAACTGAT CCACTAATTG GTCAACTGAG TCAAATTTGG 2040 

TCATATCTCG AATGCGATCA AGCCAATAAA CCATGACGGT TTCCCCATAA ATATCTTGAT 2100 

TAAAATCAAA AATATTGACT TCAAAACGTG CTTCTTCTCC ATCAAAGGTC ACATTTTTCC 2160 



WO 98/18931 



PCT/US97/19588 



1275 

CGACACTAGC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 2220 
CATCTGCTGG CATATAAGTA CGGTCTAAAA GCACTAAATT CGCTGTCGGA TAACCAATTG 2280 
TACGACCACG AGCATTACCA TGAACCACCA TACCTCTTGA TGGAAGCGCT GCCCCCAAAA 2340 

GTTTTCCTGC TTCTTTCACA TTTCCATCTA AAATAGCTTG ACGGATACGA GTTGAACTAA 2400 

TCTTTCCTTT CTCATCTTCT ACAGGTGGAA CAATGATAAC TTCTCCATCA AAGTAATTCT 2460 

TTAAATCTTC TGCTGTTTTT TTGTCAGAAC CAAATGTATA ATCAAAACCT GCAACAATAA 2520 

TTTTGGCATT CATAGCCTTG ATATAAGTTG CAAAGAATTC TTGTGCAGTG AGACTAGCGA 2580 

ATTGACTACT AAAATCAAGG AGATATAATT CTTCTACACC TTCGCGCTTT AATTTTCTTT 2640 

CACGTTCAGC AGGGTTCAAA ATATGCAAAA ACAAATCTGG ATGATAAGGC TCTAAAGCCA 2700 

TCTTTGGAGA TTCATTAAAG GTCATAACGA CGATAGGCAA CAAATCCTTT CTCGCAGCCT 2760 

TGTTGGCAAC ACGAAATAAT TCTTGATGCC CCTTATGTAT GCCATCAAAA TAGCCGAGAA 2820 

CAACGACTGA ATCAGATGGT GTGCCAATAT CTTTTTGGTT TTTTATAGGA ATAGTAATAA 2880 

TCATAAAATA ATTATATCAT AGCGATAGCT ATTTCTGGAA CAGAAAATCT GAAATGTTGT 2940 

TTTTTTCACA TGAAGTGTAC CTGTTTTCAA AAAGCACTTT ATTCTATCGT TGCTTAACTA 3000 

TGAACTTTGC AATATTCTTC TCAAAAACTT GTAGGACATC TTCAAAATTT TGCAAGGAGT 3060 

GATTAGACTT GTTCGGTAAC CATAAAGTGT CATACTATGC TTATGTATGA AAAAGCAATG 3120 

CAACTAACTC CTGAGAACTT TAAATTACTA ATTGGTGCCG AAAAGGTAGA ATTTAGAATC 3180 

GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 3240 

AATTTCCTAA ACTCCCTGAT TTGAAGCAAG TCACTTTCCC TAATGACAAA GAAAAAACCC 3300 

AAAACAGCAA AGAAAAACTA GATGACTGCT TTCCAACAAC TCCCATCTAG TGTGCTTCAG 3360 

ACTGGGCTAT TTTTCTCTCC ATCTGTTACC TTGGATTCTC AGACCGTTTC AGCTAAAGAA 3420 

TATCTTTTCC CTTATCAGAA GGAACGGCTC AAGCCATTCA GACAAGTGAA GGGACGACAA 3480 

GCCAATATTT GAAACCAGAT AGCAGTTCTT ATAGTCAATT GAAATAAAAT CTGAAGAAAT 3540 

CGAGTAGGAA ACTCATATCA ATGTTTAACA GTGTTCTATT CCAGATTCAT ACTCAATGAw 3600 

AATTAAAGTG CAAACTAGGA AGTTAGCCGC AGGTGATACT TTGGGTACGG CA 3652 
(2> INFORMATION FOR SEQ ID NO: 252: 

(i» SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi> SEQUENCE DESCRIPTION: SBQ ID NO: 252: 

GTACCGTGGT GCCAAACTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 60 

TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 120 

TTATTATCTA AAATCCGCTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 240 

TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATCGATTTG 300 

GGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 3 60 

CTACGATTCT CATAGTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 420 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 480 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGCTA 540 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 660 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 720 

TAAGGATAGA AAAAGTGATG ACA 743 
<2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT 180 

TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCCTAGTCCA GTTTGGCGAT GATTTCCTTG 240 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGA G TT G TTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 480 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 
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TTAGTAAAGT AAAACTATTG GAGGATATTT TAATGCCACA ACCTATTGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CGAAATGGTA GAACAGCTTT 720 

TGGATAAAGT CTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCCTA 780 

TGTCGGAAAA CGGATATGAG GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAG 840 

TTTGAATTAG ATCCTTTCTC CGGTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAAGCCC TTTACTGGGA TGGTCAAGGA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 1020 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATCAGA AAGTCGTGAT 1080 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 1140 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGAGTG CTACAATTGA TAGTCTCACC 1200 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 1260 

TCCTCTGAGA AAACTGTTTG CCCATCTGGA CAACTCAGTC TTTTTGAAGA GGAACAAAAT 1320 

ATGGAACAAG ACTCTGACTT ACCCAGTTGA AAGAGAAGAA ATCACCTATA AACGTAAGAA 1380 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 1440 

AGTAGAAGAG AGCATTTGCC CTGATTGTCA GGGAGATCTA AAAGAGATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 1620 

TATTCCTAAA GCCCCTTTGG CGCATAGCCT TGGCTCAGCT TCTATTATCG CTCACACCAT 1680 

CCATCAGAAG TTTAATCTGA AGGTACCCAA TTATCCCCAA GAAGAAGATT GCGCTAAGAT 1740 

GGGTTTACCA ATCACACGTA AGGAAATTGC TAATTGGCAT ATCAAGGCGA GTCAATACTA 1800 

TTTGGAGCCC CTTTATAATC TTTTACGAGA AAAGTTGTTA GAACAAGCTC TTCTTCATGC 1860 

GGATGAAACC TCTTATCGGC TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTGTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TAGTCCTCTA GTTCTGCCTA TGCGATAGCA GTCCAAGGTT 2100 

TAGCAGTAAG GCGACGCTAA GCTTGGTAAA CTGCCAACAG CTAGAAGCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGCC 2220 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2280 
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ATTTTCCTTG 


GAAAGAGACT 


GGGAGGCTTT 


GCCAGCTGAT 


GAACGACTAC 


AGAAACGTCA 


2340 


AGAACATCTC 


CAGCCCCTAA 


TGGAAGACTT 


CTTTGCTTGG 


TGCCGCCGTC 


AGTCAGTTTT 


2400 


AGCAGGTTCA 


AAACTAGGAA 


GGGCAATTGA 


ATACAGCCTC 


AAGTATGAAG 


AAACCTTTAA 


2460 


GACTATTTTG 


AAAGACGGAC 


ATCTGGTCCT 


TTCCAATAAT 


CTACCTGAAC 


GCGCCATTAA 


2520 


ATCATTGGTT 


ATGGGACGGA 


GTAAAAGAGT 


CCAGTGGACT 


CTTTTAGCCT 


GAGCTCAGTT 


2580 


TAAAAAAGCG 


AGGGTGGTTA 


TTTTCTCAAA 


GTTTTGAAGG 


AGCTAAAGCA 


AGAGCTATTG 


2640 


TTATGAGCTT 


GTTGGAAACA 


GCTAAACGTC 


ATCAATTATA 


GTGCGTTGAA 


TCTATAACAG 


2700 


TACGCATCGA 


CTGCTAAAAC 


ATTTCTATAA 


ATCAATTTTC 


CTTTCCTAAT 


CGATTTGTTC 


2760 


ATATCTTATT 


TCAATCCATT 


ATAAATAGCG 


AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


2820 


TTCCAAACGA 


GGAAACTCTC 


GTAAACAAAG 


AGGTTTTAGA 


GGCCTATTTA 


CCGTGGACTA 


2880 


AAGTTGTACA 


AGAAAAGTGC 


AAATAAGAAA 


TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


2940 


CTAGTCTGGA 


GATTTTTCAA 


TAGACTTCGT 


TATTGGACGG 


TTACAATTTA 


TTATATGAAA 


3000 

m* W V V 


ATCCCATATT 


ATTCTCCAAT 


TCTATATTTT 


ACCTTTCTAA 


ATGTATAGAT 


TAACTACCTA 


3060 


ATTATAGCAT 


ATAACGCAGA 


TTCCTTTCAA 


TCGTATGATT 


TACTGCATTA 


AATTAAGTAA 


3120 


AAAAATAAAG 


GCAGTCCGAA 


GACTGCCGAT 


ATTTATCTCT 


CATCTCTTTA 


ATTATGGTAA 


3180 


GTAAATAAAT 


AATTTCCCTA 


AAGATATGGA 


AATTATTAAT 


ACTATAAATA 


CAT ATT AT AA 


3240 


AGTTTATAAA 


TACTGTAAAA 


ATCCTGAAGT 


TAATTTTCTA 


ATAAATATCA 


ATATGTGTTA 


3300 


GTATCTTTTA 


AATTTTTAGA 


CAATTTACTA 


GTTCTATAGA 


CATGTTTAAC 


AGACTCTATT 


3360 


TTACAATTCA 


AAAATTTCAT 


CTGCCACTTC 


ATTTAAAAAT 


TCTATATCAT 


GGGAAACAAT 


3420 


AAAAATTATT 


TTATCCATGG 


TTTTATACTP 


ATTAATCAGT 


TCAGATATTT 


TTATCATATT 


3480 


GGAATAATCC 


ATACCACTTG 


AAGGTTCGTC 


AAAAAAGACA 


AATGGAGAAT 


TCTTGCACAT 


3540 


AACAGATGCT 


ATTGCAAGCC 


TTTGCTTTTG 


CCCTCCTGAT 


AAACTCATCG 


GATGCCTTTC 


3600 


AATAAATTCG 


TCCAGGCATA 


AATCTTTTAA 


CCCAAATCAT 


TCATACCTCT 


CTCAACTAGA 


3660 


TGTAACTTAC 


AAAACCCCTG 


ACCTCATGAG 


ccAcriTcrr 


CCTCCTCATG 


AGGTCAGTTT 


3720 


TACTTTCTGC 


TGTTCCAGTA 


TCGTTTTTCC 


TCGCTAGATT 


TCCTCAAAAG 


GGCAGACTCC 


3780 


TCCCTTGGTT 


CGTCACACGA 


TTTTTTCATC 


TCGACTGTTC 


TTTAATGCAT 


CATTAACGAC 


3840 


GCTTTTCTTC 


TACGTGGTTC 


ATAAGGAACA 


GGAAGATTCA 


GGTTGACTTT 


TCTAATCCTA 


3900 


GAATAAAGTG 


CTGAAAACAA 


TTCGGAATAG 


GCATAGAGAC 


TAGACAATTT 


GAGGAGCTGC 


3960 


TTGCGTCCTG 


TTCGAACACA 


TTTTCCCACC 


ACGTGAAGAA 


AAAGATGGCG 




4010 


(2) INFORMATION FOR SEQ ID NO: 254: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2789 base pairs 
<B> TYPE: nucleic acid 
<C> STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

ATGCATCCGT TTCTCAAGCC TAAATTGTAA TTTTTTTCAA TTTAAAACAG AAAAACCCAG 60 

GAAAATCACA TAAAAATATC ATTCCTAGGC CTATTTATCC TATTTCTCTC TGAAAAATAT 120 

GAGTATTCAG TCGGTCAAAT GAAGCTGAAC GAACTCATTT TCCCTCGCCT AATTCAATGA 180 

TTCGATGACA TTGTTGGGCT ACATAAGCAT CGTGGGTCAC GATAATGACT CTTTTCCCCT 240 

CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA TCCAGAGAAC 300 

CTGTCGGTTC ATCGGCTAAA ATCAGCTGGC TGGGTTTTAA GATGCCTCTA GCAACTGCAA 360 

TTCGTTGTTG TTCGCCCCCA GACAACTCGG AGACCCTTTG ATGCAAAGTA GCTGATAAAC 420 

CTACTCTCTC TAAAATCTCT TCCACCTTTT TGAGCTTGTC TTTCTTAGGC AATTTCACAT 4 80 

ATTTCAGCGC CACATGAGAT TGTACTCGAC CGTTTCATCA TCAATCAGGG CAAAATTTTC 540 

AAACAGATAA GAGATATGTT CACGGATTAT TGTTTGCGAC TTAGCAGAAT TAACCGCTAG 600 

ATTTGTCTGA CCAAAAATCT CATACCGTCC GCTATAATCA CCATCTATCA AACCCAATAA 660 

ATTTAACAAG GTCGACTTCC CACTACCACT CTTACCAACA ATAGCTACCA AATCCCCCTC 720 

ATCAATCCTG AGAGATAAGT TATCCAAAAT CACTTTTCCC CCAATGGTTT TGGTAATATT 780 

TTTCAACTCA ATCATAAGAT GCCCCCTTTC AATAACTCTA CTAGACTTCT TTTCTCCATC 840 

CTAGAAGCTA AGCCTAGCAC AAATAGTATA TCCAGACATG TAAAACCTGC AAACAGTAGA 900 

AGTGGTAAGA ACGCATGGGC AAAGAAAATC AAGACTAGAA GAGGGAAACT ATAGCCCAGC 960 

AAGAGCAGAA CGAGGAGAGG ACGGTAGCGA TCGACCAGTT TCCACCCCAT AAACTTCTTG 1020 

GTAATGATAT CCCTGCGCTT CAATAAGAAA GTTGTTACTA GTAAGAAGTA GGAAATCATC 1080 

ATGCTAAGGA GACCAAACAA AGCAAAGAGT AGGTTAAAAT TCCGAACAGC ATCTCGATAA 1140 

GAATCCACTT TCTCTTGTTG AATGGCTTGA ATAGATGAAA ATTTTAAATA ATTTCCATCT 1200 

GACAATTTCT CAACTAACTC TGTAATCTCT TTTTGATGTT GAACCGTATT TTCAATTTTA 1260 

ATCGGATTAT TTAAGCCAGT TGTTGACAGG CAGGCTTTCT CATCCCACAT CATATCAGAA 1320 

TCATTGACCA AGCTAATAAT TGGATTGGAG AGATTTTCCT TTCGCTTATC ACTATATGGG 1380 

AAAAATGACC AATCTCCTTC ATAATAGGCA ATCTCGACAT CCATCTCCTC TATCGTTCGT 1440 
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TTTTGCTGCT CTTCATACTT CATCGAATGA AAGGCAATTA ACTTCCCCAA GAGCTGATTT 1500 

TTATCTTCTT CACCTTTCGT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTCCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1B00 

AAGGTTTCTA AGGTCAGGTA ATTACCTTGA CTTACCCACT GTTGCTGATA AGCAAGTTCT 1860 

TTGTTTTCTT GTTCTAAACT TCTGCCCACC CCAATCAGTA AGGCCGTCAG TAAAATAGTT 1920 

GTCCCTATTT TCATCACATA ATTGAAGATA AGACCAAATT TGAAAGATGA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 2040 

TAAAGCTGCA ACAGCAAAAA ATGAGACAAC CACAGCATAG CAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTG ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AGCTAAAACA GCCCTATCTT GAAAACC AAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2280 

ATAGCCAAGA CAAAAACAAT GGCAGATAAA AGCAAATCTC GATTTATGAC TTCCACTGCA 2340 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTCCATTTC 2400 

TGTACAATCC TATOCTTGTC CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 2460 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2520 

AGAAAGGTCG GAATCGTACC AAGTTTATTG GAAATTTCTT TATTACTATA GACTCCTTCA 2580 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 2640 

TCAAAAACAC CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTGATGAT GGAGACTTTA 2700 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 2760 

TTCAAATACA AATCCAAAGA ATCTACAGG 2789 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
CTGCGAATTT TATTAAAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 



60 
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ACTATGGGAA AGAAGCGCAT GATATTTATG TTCTTCCGGC TGTTAGTCCA AGTCAAGAAC 120 

CATTTGGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTAGTC AAAAAAATTT GACAGGAATA ACTGTGACTA 300 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 360 

TTTTAAGTGA TGATGCCATG TTCTTGGATT ACCAAGTTGA TACTTTAGAG GGGTCTAGTG 420 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GGAGATGGAG 480 

CTAATCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 540 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTGG TACCATTATA 600 

GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT TATTTAGACA 660 

GTTCCGGTAA GATGCTTACA GATTGGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 720 

CAAATGGAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 780 

CATCTGGTGA GTGGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 840 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATGATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGCTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAAAGA AACCTACTGG AGAGTGATAG ATGGGAAGTA CTATTATTTT GATCCTTTAT 1020 

CCCGAGAGAT GGTTGTCGGC TGGCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCTCTTA GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACGAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATGATAGCC AAGCAGAGAA ACGAGTCTAT TATTTTGAAG 1260 

ATCAGCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 1320 

ATTTACAGAA CGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GGAGAGCTAG 1380 

CACGTGGTTG GGTTAAGGAT TACCCTCTTA CGTATGATGA AGAGAAQCTA AAAGCAGCTC 1440 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 1500 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 1560 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAG AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAC GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATGGAGATAT GAAAACAGGC TGGTTCCAAG 1740 

TCAATGGTAA CTGGTACTAT GCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 
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GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAAGT CTTCACAATC AAAAAACGCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC " TTAAGCCACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT CCGATTTCAG AATCCCAAAG AAGGACTCCA TC AT ACT ATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 22 80 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2460 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2495 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

TACCACCGTA TTCATCCAGC AAGATTGCCA TTTGTCTTTG GGTATTTCGC AGTTCTTTTA 60 

GCAAGTCATC CACAAAAATA GTTTCAGGTA CAAAAAGTGG ATCTTGTAAA ATTCTCTTCC 120 

AAACAATATT GTCAAAACCG TCCACAAAGC CTGCCTTAAG GAGACTCTTG GTGTGAATGA 180 

TTCCAATTAC ATTGTCCTTA TCCCCATCAT AAACCGGGAT ACGAGAATAA TTTTGTTTTA 240 

AAATACTTTG GATAATGGCT TGACTATCAT CCTGAATATC CACCATAAAG GCATCCGTTC 300 

GAGGAACCAT AACCTCTCGT GCCATCAGTT CATCGAGCGA AAAGACACCT TGTAGCATCT 360 

CAATCTCATC AGCATCCAAT GTTTCTTCAC TATTTGTCAG CATATAGGCA ATTTCATCAC 420 

GGGTCATCTT TTCATCCGCA TCATCGAATG ACATAGGAGT CAAATGGCTC AAGAAATTGG 480 

TCGAAGCAGC TAAAAGCCAA ACAAAAGGAC TGACTAGTTT TCCGATCCCA ATGATAATCG 540 

GCGCTGTACG AATTGCCAAG GCATCCTTTA GATTAAGAGC GATTCTCTTA GGATATAATT 600 

CCCCAAAAAC GATGGAAATA TAGGTCAAAA ATGCCAAGGA TAGAAAAGTT GCCACGGCTT 660 

GTGCTGTTTC GCCATTCCCA AGCCAAGAGG CAATCACACG TCCTAGAGTA TCAGTTAAAC 720 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGCTTGAT AAAAAGTGGT 780 
TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCCTCT GTCTCCTTCT TCCGCCTTTT 840 



GTTCAACTCG GGCACGATTA AGAGAAACGG 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 60 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 120 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 180 

ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 240 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 360 

AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 420 

CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTCTA TTATTAGGGA 480 

AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 540 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 600 

CAGCTTTTTT CTTTGCCTCC TTACCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 660 

TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 720 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 780 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 840 

TCAACATGGA GATATCTTTA TCTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 960 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 1020 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 1140 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 1200 
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AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 
(2> INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



1245 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
ATGCCTATGT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT AGTTTGTCTG 
AAGCTGAGAG AGCGGCAcCC AGGCTTATGC TAAAGAGAAA GGTTTGACCC CTCCTTCGAC 
AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA GAAGCTATCT ACAACCGCGT 
GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC AATCTTCAAT ATACTGTAGA 
AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT TACCATAACA TCAAATTTGA 
GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT ACTCTTGAGG ATCTTTTGGC 
GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG CATTCAGATA ATGGTTTTGG 



TAACGCTAGC GACCATGTTC 
AAAACCAAGC GAGGAGAAAC 
GAAACCGCAA AGCGAGAAAC 



AGAGGAATCA 
TGAAGATTTA 
CACAGGATTA 
AGCTGAAAAA 
TCCTAAAAAC 
TTCTCATTTT 
TCTACTTCTA 
ATATCTTAAC 
CACTAGTTAA 
AGCTCTTGTC 
GGATTACTCT 
TTATCGTTTC 
CTAAAGATGT 



GAAGAACCTC 
CTTGGAAAAA 
AAAAATAATT 
CTATTGGCTT 
AGGATAGGAG 
TTTCATGAAA 



AAAGAAACAA AAATGGTCAA 
CTCAGACAGA AAAACCTGAG 
CAGAGTCTCC AAAACCAACA 
AGGTCGAGAC TGAAAAGGTT 
TCCAGGATCC AATTATCAAG 
TACTATTTGG CACCCAGGAC 
TATTAAAGGA GAGTAAGTAA 
AACGGGAAAA CGAAAAATGA 
ATGTGCAAAA TATAGTAGAT 



GCTGATACCA 

GAAGAAACCC 

GAGGAACCAG 

GAAGAAAAAC 

TCCAATGCCA 

AACAATACTA 

AGGTAGCAGC 

GAGCAGAATG 

TGAAACTAGA 



ATCAAACGGA 

CTCGAGAAGA 

AAGAATCACC 

TGAGAGAGCC 

AAGAGACTCT 

TTATGGCAGA 

ATTTTCTAAC 

TGAGTTCTAG 

ATAGTATACC 



AAACATTGTT AGAAATCGAT TTGACTGTCC 
AGATAGTGTA AATAAAGATA AACTATTTAC 
GGAGTAATGA TGAAAAAAAG AACAATACTA 
TTAGGAGCAT GTGGTTTCTT GGACATATTG 
TTACTGCTAT TTTAGAAACT GGGGTGGTTT 
ACATTCCAAA CACATTGCAG AAGGTGTTCT 
TCCGATTACT TATCTAAGAG GAACCGAGGG 



TGTTCTTATT 
TGGCTAATTA 
TTATTGATGG 
ATCCTGGATC 
GATGGAAAGT 
TGAACTGATT 
CGGAGGAATT 



TCATTTTACT 

ATCAGTTAAA 

CCAGTCTGTT 

ATTCTCATCA 

ATTGGTCTTG 

AGTAAAGTAG 

GGAACGAGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 1440 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCt TGAAATCAAT AAATAAGGAA 1620 

;> 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 



(2) INFORMATION FOR SEQ ID NO: 259: 

<i> SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 970 base pairs 
(B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGGAGTGGAG AnATATGAAG ACACAAATTT TCACATTATT GAAAATCGTT GCTGAGATTA 60 

TTATTATTTT GCCATTTCTA ACTAATCTAT AAGTTCTTTA TATTGCTGAA AACGCAATTC 120 

AAAAAGGGCT ATTAATTGTG GATTTTCTAA TACCTGCAGA GATTGGATAA AGCGTTCAAT 180 

CTCTTTTTGA TTGCTTCCCT TTGTTTGAAG AAAGACACTC ATCTTCTTTA AAAATTGCCA 240 

CGATACTTTT TCAAAAACAT CATACGGTCG TAACATCCTC TCCAACTCGG CTTCGAAGAT 300 

TGGGATGTAG GAGAAAAGTT TTCGCTCCAT GAGTTCTGAT AAGATATTTA AGAGTCCTTG 360 

CTTCATATAC AATCGATTGT GTACTAACTC TTTAAATTCT TTGGATTTTT CGAGTAAGGA 420 

GGTTGATAAA AAAATCAGAT CTTGATTGCT CAAGAAGGGC ATGGTATTGC AAAAGAGATA 480 

GAGTTCAAAC CAGGTCCAAG ACTCGATAGC ATAGAGATAG GTGGTCAAAA ACTCGCTATC 540 

CTCCTCTGCT AGTGGGTAGC TTTTATTTAG TGAATGGATG GCATCTTTAA TCACGATGGC 600 

ATTCAAACGA CGATAGGTCT GCGCCATCTG TTCTTGATCG ACTTCCTCCA ATAGCTGCTC 660 

TAAAGCAGCT ATATCCTGAT GGGCAAAGCG ATTCACAACC TTTCGACCGA TTCGCATATG 720 

TGGAGATTCT TGATAGTTGT TGAGCTTGTG CCCAAACTCA TCAAAGGTCA CATTTATACC 780 

TTGGATAGCT AGAATCAACT TATCCGCAGA CAGCATAGAC TGCCCTAGTT CAAACTTGGA 840 

CAACTGAGAA GCTGTTAGAC CCTCACAAGC CACATCTGAC TGCTTGAGCT TTCTCGCCAA 900 

ACGTAATTCC TTGTAAAATT CCCCCAGTTC CATTCTCTCA ATCATCTGAC CACCTCCTAG 960 

CTTTTGCAGG 970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

GTTGACCACG GCTAAAACTA CCCTAACTGC AGCTATCACA ACTGTTTTGG CACGTCGCTT 60 

GCCTTCATCA GTTAACCAAC CTAAAGACTA TGCGTCTATC GATGCTGCTC CAGAAGAACG 120 

CGAACGCGGT ATCACTATCA ACACTGCGCA CGTTGAGTAC GAAACTGAAA AACGTCACTA 180 

CGCTCACATC GACGCTCCAG GACACGCGGA CTACGTTAAA AACATGATCA CTGGTGCTGC 240 

TCAAATGGAC GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG 300 

TGAGCACATC CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 360 

AGTTGACTTG GTTGACCACG AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT 420 

ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 480 

AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA ACACAGTTGA 540 

TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA 600 

CGTATTCTCA ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 660 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT 720 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 780 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA TCGCTAAACC 840 

AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA 900 

AGGTGGACGT CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCCTACTAC 960 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA 1020 

CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAGGTA CTACATTCTC 1080 

TATCCGTGAG GGTGGACGTA CTCTTGGTTC AGGTATCGTT ACAGAAATCG AAGCTTAATT 1140 

CGATTTAGTT CCCAGAAGAA CAATTATTTA AGTTAGACAC TAAAAGAATC TTGCTTGGCA 1200 

AGGTTCTTTT TTTAGATATT GAACTAATAC TCAATGAAAA TCAAAGAGCA AACTATAATA 1260 

TATTGAAACT AGAATAGTAC ACATCTACTT CTAAAACATT GTTAGAAATC GATTTGACTG 1320 

TCCTGATCGA TTTGTCTTGT TCTTATTTCA TTTTACTATA GAAAGTTAGC TACAGACTGC 1380 

TCAAAACATT GTTTTTAGGT TGTAGATAGA ACTCACGAAC TCAGtAACAT CTATACGACA 1440 
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AGCCGAACCT GACGCGGTTT GAAGAGATTT TCGAAGAGTA TAATACTAGA CTAAAATCAA 1500 

AAAGCATTAT ACAATAGTAA TATGAAATCA ATTAAAGAAG AAATCCAAAC CATCAAAACA 1560 

CTTTTAAAAG ACTCTCGTAC AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTCGT 1620 

CTGATGCGCA AATCTTATAA AGAGATTATA GAACTTTTAT AGTGGTTTGA AATAAGATGT 1680 

GAACAACTCT ATCAGGAAAG TCAAACTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA 1740 

CTGTTATAGA TTCAATACAC TTTAGACTGT AATCAAACAA CGATTTGGCG AAATGTAAAA 1800 

AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC ACGTCGTGGT CGTAACCATG I860 

CTTATATGAC CGTTGAGCAA GAGAAAGTCT TTCTTGCCCG CCATTTGAAG GCTACAGAGG 1920 

CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA TAAAAAGGAG TTAGGTCGTT 1980 

CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA TGGTTGGCGA AATATTACGC 2040 

CACGTCCAGA ACATCCTAAG AAAGCAGATG CTCAAACCAT TGTCGCGTCT AAAAATAAAG 2100 

TCTCAATTCA AGAAGACAAG TGAACTGCAC CCCAAAAGTT AGACAGAAAA AATCTAACTT 2160 

TTGGGGTGTT TTTATTATGA AATTAACTTA TGATGATAAA GTTCAGATCT ATGAACTTAG 2220 

AAAACAAGGA TATAGCTTAG AGAAGCTTTC AAATAAATTT GGGATAAACA ATTCTAATCT 2280 

TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG TTCGTCAAAA AAGGAAAAAA 2340 

TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT AAAGTCTGAC ATGAAGGCTG 2400 

GACTAAAGAT AGAGTTTCTC TTGAATACTG TCTCCCAAGT CGTACGATAC TTCTTAACTG 2460 

GCTAGCACAA TACAGGAAAA ACGGGTATAC TATTGTTGAG AAAACAAGAG GGAGAGTACC 2520 

TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT CCGATTGAAG GAGGAAAAAG 2580 

AGAAAGAAGA AAGACAGAAA TTATTCAAGA ATTAATGACT GAGTTTTCGT TAGATATTCT 2640 

TCTAAAAGCC ATTAAACTAG CTCGTTTGAC CTACTACTAT CACTTGAAAC AGCTAGATAA 2700 

ACCAGATAAG GACCAAGAGC TTAAAGCTGA AATTCAATCC ATTTTTATCG AACACAAGGG 2760 

AAATTATGCT TATCGTCGGA TTTATTTAGA ACTAAGAAAT CGTGGTTATC TGGTAAATCA 2820 

TAAAAGAGTT CAAGGCTTGA TAAAAGTACT CAATTTACAA GCTAAAATGC GACAGAAACG 2880 

AAAATATTCT TCTCATAAAG GAGACGTTGG CAAGAAGGCA GAGAATCTCA TTCAAGGACA 2940 

ATTTGAAGGC TCTAAAACAA TGGAAAAGTG CTACACAGAT GTGACAGAAT TTGCCG 2996 
(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTGCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTC AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 420 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 480 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 540 

CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 600 

GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 720 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 60 

ATTGGATGTG ACCACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 120 

AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 180 

TTTGGTAAAA AAGTAGCGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATCTATC 240 

TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 300 

TTGGTTCGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 360 

GGTTTTGGTG GTAAAATTAC TGGTGAATTG CTTATGTGTG ATATCATCGA AGCTTTCATC 420 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 480 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 540 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCGAGTAC TTTCAGAATT- 720 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

CCGTTCAAAG TCTTCATAAC ACTCGAAAGT CACAGTTCTT TCGTTCTTGC TGGCATCTAT 60 

ATAGGTAATT TCAATCATGT TTAAAACTCC TTTGTTTAAT GCTAACTTTA TTTTACTCCT 120 

TATAAAAGAG AATGTCAAGA AAAATGATTG CGCACGCAAC TTTTTTTAAA ATCATCTTAA 180 

ATCAAGAAAT CCAAACCTGC TTCCAAGCTT TCTTCGACAG TCTTTTGTAG CGAGGCCAGT 240 

GTCTTTTGCC CATCATTTGT CAGGCAGATA AAACTAGAGC GTCTATCTTC ATGGCAACAC 300 

ATGCGACTGA GTAGACCGCA ATTTTTAGCT TCCAAGCGAG CCACCATCCT AGAAACTGCG 360 

CTCGGGCTCA GATGAAGCTT ATCTGGCAGG TCAATCTGGC GTAGAGATTT TTCTTCAGCC 420 

AAGTCCAGAT AGTAGAGCAG GTAGAACTCT TTCAAGGTCA GACTTTGCTC GCTCTGTTGG 480 

GCAATGGTCT CTTCCAAGAG ACTTTCAATT TCTTTCTGAC GCCGATTGAA GTCAAACCAT 540 

TTTTCCAAAT AGGTCATAGT GTCTCCTTTC TTTTTAGAGT CATAAATAGA AGAAAGTCCA 600 

TTAACGGGCA GTCTCTGCGT CACAAGATGA TTGCGCATGC AATAATTATA CTACTTTTCA 660 

AGAATGCTGG CAAGCTCTGT TTTTTAGTGG TTTTATTTTT GTGTGAATAA TGGGGGAATC 720 

CTATTGTTTC AATTTCTAAC TCCTTATCAC ATTCGAATTC AGATTTTATT TCATTTCTCT 780 

ATCTATAGTT GCTTAGTTTA AAATAAGCAT GGTCTAATAA AGCTATGCAT ATAGTACTGA 840 

TTTTAAACAA GGAGCATTAG ATTCCATTAA AGGAGGGCAC AGACATGTCG AGGCGGCCAA 900 
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AGTTTTTCAT GTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG TAAACAAGGG 960 

AACTTAGAGC GGAAAAAGCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA AGAATTGAAA 1020 

GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG TTTTGATTGT 1080 

GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA AAAGACGACC U4 0 

AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT GGATAACCTA 1200 

AAAGATTTAC CAGTCCTATA TATTGACGAA ACGGGAATCG ACCGCTACCT CTATCGTCCT 1260 

TATGCAGGGC CTCCTAGAGG GGAGAAAGTC TATGGCAAGA TTAGCGGACG GCGTTTTGAG 1320 

CGGACTAATG AGGTGGAGCA AAAACTCAAT GGTAGTTTTC TAATCAGATA TATTGATTCA 1380 

CAAATTAGAG AATGAAAGAA TAATTATGCA TAAAAATAGG AATATAAACC AAAAATTAGC 1440 

TGATTTATAC TCATTTGCGT GTCTTTATAA AAAACTTATC TTATAATATA TATATATATA 1500 

TATACAAAAT AGTAAAATGC TT riTO TT'JT TAGCAAAAAT ACCTCAAGTT TCTTGCTATT 1560 

TTGGGTTCCC TATTCTATAA TTATAGTATG GTAATTTATT TATATCCATA CATGAAAATA 1620 

ATACTCGAAA GGAAATTTCA AAATATTTTT TAGACGTCAG AAGGGTGAAT ATAGAGAAAC 1680 

AGACCGAGTA ACTCGGTTCA AATTAATCAA ATCAGGGAAG CATTGGCTAC GGGCCTCGAC 1740 

TTCTCTTTTT GGCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC AGGTCATGAC 1800 

CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGG CTTGATATCC TCAAGGGGAT I860 

AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG TATTTACAAA 1920 

TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT GCTTTGGCAA CAAATGATAC 1980 

AGTAGTTCTA GCTAOGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT TGTCAGCTTC 2040 

AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA CCTCAGCAAG 2100 

TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCGACA AGTATTTCTG CATCATCTAC 2160 

TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGG TCGAAGAAGA 2220 

TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 2280 

TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT CTATAAAAAA 2340 ■ 

TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA ATGCAAGTCT 2400 

AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT CGGTATATCG 2460 

TGTTCCAATT TACTATAAAT TGAAAGTGAC AAATGATGGT TCAAAATTGA CCTTTACCTA 2520 

TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA GTATGCGTCC 2580 

TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC TTGGCAGTGA 2640 

TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA GACAGGTTCT 2700 
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ATCCTATAAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGGG GAAATGGTGC 2760 

CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT GGACTGTACC 2820 

AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 2880 

TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACGA CCAGTCAGTC 2940 

ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 3000 

AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGGCT TCAGCGTCAA CCAGTGCGTC 3060 

AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA ACTCCTTCAC CCTCAGCATC 3120 

GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCGTCAA COVGTCCTTC 3180 

GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCGG CCTCAGCAAG 3240 

CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC 3300 

TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACC AGTCCTTCGG CTTCAGCAAG 3360 

CACAAGCGCC TCGGCTTCAG CATCAACGAG TACGTCAGCT TCAGCGTCAA CCAGTGCTTC 3420 

AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAACTATCT CAGCGTCTGA ATCGGCATCA 3480 

ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CAGCAAGCAC CTCAGCTTCT 3540 

GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 3600 

CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 3660 

CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC GGCTTCAGCA TCAACCAGTG 3720 
CCTCGGCTTC AGCGTCAAAC AGTG 

J/44 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDN ESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 

ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCGGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AGATCTGACT TGGAGGCTAG 300 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC. GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 420 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGCATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 540 

AGAGGTGGCA GACCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGGC ATTTGATTTA 60O 

TTGTGCTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAAGTATCTA 660 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTPGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 
AGCAGGCTAA GAATC 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D| TOPOLOGY: linear 



795 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TGCTACTACC ATTTCCCATA GCGACCGACA 60 

TACCAACGCC AGCCAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT 180 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 300 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 360 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 420 

AAGAACCCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAG/.G 480 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 540 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAAGATATA CTGGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CCCATCCTTA GTGGATTTCA 720 

AAACACTCTT GCGATTGTTG ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATGCAG 900 
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GCCGATCCTT 
CCACTTGAAG 
AAACCCAAAC 
GGAGTGGTGA 
ATGGAAGTTC 
GTAGTGATGG 
CTGACAAAGG 
AAACACATAC 
TCATGAAACA 
ATCACATACA 
TTGATATGCT 
ATATTCCCAA 
CTCTGGTATC 
GCGTACAGTT 
GCTTGAAACT 
TGGAAGGCTA 
ATGGtACTGA 
TCAGTTCTAT 
ACCTCAAAAA 
TTTGAGCTGA 
TCAGTTCTAT 
ACCTCAAAAA 
TTTGAGCTGA 
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CTGGCGCTGT TGTGACGAGT TCAGATAATC 
GAATCCAAGT CCACGCCTTG GACTTTTTTA 
ATATGCTGCT CCTCTACCAG AAAATTAAAG 
TCACACACGG AACCGATACT TTAGAGGAAA 
CCCATATGCC TATCGTTCTA ACAGGAGCCA 



TGTTTATAAT TACCTAAGTG CTTTACGAGT 
AGTTTTGGTC GTTATGAACG ATGAAATCCA 
GACTAATGTC AGCACCTTCC AGACTCCAAC 
GGAAATCCTC TACTTCAAAA CAGCTGAACC 
AGGTTTAGTC CCTATCATCT CGGCTTATGC 



GGATTTAGAA 
AGAAACGGCT 
ACGATGCTTT 
GCAAAAAGCA 
CCTCATCGCC 
ATACTCTTCG 
CTTCGTCAGT 
CTACAACCTC 
CATGTTTTGA 
CTTCGTCAGT 
CTACAACCTC 
CATGTTTTGA 



CACTTGGACG GTTTGATTAT 
CAAAAATTAG AAAGCCTTCT 
AACGGTATTG CCGAGCCTGT 
GGCGTTTTCT TTGTTAAAGA 
CTCAATGCCG GACTAACAGG 
AAAATCTCTG CAAACCACGT 
TTCATCTACA ACCTCAAAAA 
AAAAACATGT TTTGAGCTGA 
GCTGACTTCG TCAGTTCTAT 
TCTATCTACA ACCTCAAAAA 
AAAAACATGT TTTGAGCTGA 
GCTGACTTCG TTAGTTTCAT 



CCATGAACCA 
ACCTTCCAAG 
AGGAAGCAGA 
CAGCCTATTT 
TGCGTACtCC 
GGCCAGCGAT 
CGCTGCCAAG 
ACATGGCCCC 
TCGTGTTCGC 
TGGTATGACA 
CCAAGCCTTC 
GCAAAAAGGA 
TTATGCATAC 
ACTCAACCCC 
ACAGGCTTTG 
CACGTCGCCT 
CATGTTTTGA 
CTTCGTCAGT 
CTACAACCTC 
CATGTTTTGA 
CTTCGTCAGJc 
CTACAACCTC 



TGTGTCCAAC 
TCCCCATATC 
TAACTACGAT 
CCTTGATACC 
AATGAGCTCG 
GACAGGGCTG 
TATGTCACCA 
CTTGGTCTCA 
TTTGACCTTG 
GATGAGCTGA 
GGAGCTGGTA 
ATTCCAGTCG 
CAGGGTGGGG 
CAAAAAGCTC 
AAAGACTATA 
TACCGTATGT 
GCTGACTTCG 
TCTATCTACA 
AAAAACATGT 
GCTGACTTCG 
TCTATCTACA 
AAAAACATGT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1BO0 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2231 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1310 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG 


CTCCGAGGTT GACTTTTTAC AAGGGGACAG 


GTGAATATTA 


TCTAGACCTG 


60 


TCAGAAATTC 


TCTTCTTTGA AACAGAAGGG AGCAAGATCT ACGCTCATAA 


CCAGAAGGAA 


120 


GCTTATGAGG 


TTCGCCTCAA GCTCTATGAG TTGGAGTCTA 


TCTTGCCTCG 


CTATTTTAAT 


180 


CGAGTTTCCA 


AGTCAACGAT CGCAAACATC CGTCAGATTT 


ACTCAGTGGA 


CAAGTCCTTT 


240 


TCAGGAACGG 


GCACCATTTC CTTTTATCAG ACGCACAAGG 


AGGTTCATGT 


CTCACGGCAT 


300 


TACCAATCCC 


TCCTAAAAGA AAATCTAAGA AACATGAGGT 


AAAAAACATG 


AAAAAGAAAG 


360 


CATTTGGTAT 


TGTTTTATTG GTTTTAGCAG CTTGGATCTT 


GCTGCAAGGG 


AATTTTGGAA 


420 


TTCCTTCTTT 


GGATGGTAAA ATATGGCCTT TACTAGGTAT 


TGTTTTTTTT 


GCTTATAAGT 


480 

t> *J V 


CCATTGAGTC 


CATCCTTAGA CGTCATCTCA CTTCGGCAGT 


TTTTACAGGT 


TTACTGGCGC 


540 

*j v 


TCATCATTGC 


AAATTACGCT TATGACTTGT TACCAGTTAC 


CAATCATTCT 


CTTATTTGGG 


600 

W W v 


CTAGCATCTT 


GGTGGTACTT GGTGTTGGTT ATCTGACGCA 


TTCAAGTAAG 


TTCTGGAATG 


660 
www 


AAAAAAAATG 


GTGGTACAAT GGGAAAAAAA CAGTCGTCAC 


GGATAAGGAA 


GTCGCTTTTG 


720 


GTAGCGGGAC 


CTTCTATAAG CAAGATCAAG ATCTCGTAGA 


TGACCAAGTG 






TTGGGGATGC 


TAAAATCTAC TATGATAATG CAGAGATGCT 


AviulvjtAi III 


GCAACTTTAA 


p d n 


ATATTGAAGT 


GGCCTTCGGG AATGCAACCG TCTATGTTCC 


ACAACACTGG 


CGTGTAGATT 


7 U W 


TGAAAGTAGA 


AACCTCCTTT GGTGCAGCTA AGGCTGACGC 


TCCTGTAGCC 


CCAACCAGCA 


960 


AAACCTTGAT 


TATCCGTGGA GATGTGGCTT TTGGGAAGTT 


GGAAATTGTC 


TACGTTAAAT 


1020 


AAAAAAATCT 


TCACTTCAAC CATCAAAATA GACGTACTAA 


GAGTAGGAAA 


TTGATGCCTT 


1080 


GCTCTGATTT 


CAGTTCTATG GTTGTTAGAC TTTAAAAAAT 


GAAATGCTGC 


CTTTAAAAGT 


1140 


TGTATATTTT 


TCGATATTTT GGCTTTTACG TTTGATGTAT 


CTATGTACTA 


CAGCGTAGAT 


1200 


GATGTAGTGT 


CAAATGCTTT TAAAAAACGG ATGATATTGG 


ACAGTTTTTT 


TGCCTTTAAT 


1260 


TGCTCAGGAA 


CCATGAAAGT CAGTACCTGG GTTTATGACA 


AGGGAGAATG 




1310 


( 2 ). INFORMATION FOR SEQ ID NO: 267; 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 
ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 



60 
120 
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TAGCTATCTG 


TTTTCCGATT TCAAGCTTCT 


TTAGCCCACT 


TCTAGCGATC 


GTACCAACAG 


180 


CGGCTACAGC 


TCCAATCITG ATTATCGTTG 


GGATTATGAT 


GCTTGGTAGC 


TTGAAAAATA 


240 


TCCATTGGGA 


TGATATGTCT GAAGCAGTTC 


CTGCCTTCTT 


CACATCTATC 


TTTATGGGAT 


300 


TCAGCTACTC 


TATCACTCAA GGGATTGCAG 


TTGGTTTCTT 


GACTTACACT 


TTGACTAAGC 


360 


TTGTTAAAGG 


TCAAGTTAAA GATGTTCATG 


TCATGATTTG 


GATTTTGCAT GCCTTGTTTA 


420 


TCCTTAACTA 


CATCAGCATG GCCTTATAAT 


AGAATGACCC 


AGGGGGATTT 


CCCCCCTTTT 


480 


TTAATACAaG 


GAGATAGGTG ATGAAAGAGA 


AAAATATGTG 


GAAAGAATTG 


TTGAATCGTG 


540 


CAGGCTGGAT 


TTTGGTCTTT TTACTTGCCG 


TCCTTTTATA 


TCAGGTTCCC 


CTAGTGGTTA 


600 


CCTCTATTTT 


GACTTTAAAA GAAGTAGCCC 


TGCTACAGTC 


AGGGCTGATA 


GTTGCTGGCC 


660 


TTTCAATTGT 


GGTTCTGGCT CTATTTATTA 


TGGGAGCTCG 


TAAAACCAAG 


TTAGCTAGTT 


720 


TTAATTTTTC 


TTTTnTAGA GCTAAAGATT 


TGGCACGTTT 


GGGCTTGAGT 


TATCTAGTTA 


780 


TTGTCGGGTC 


AAATATACTT GGTTCCATTT 


TATTGCAACT GTCAAATGAG ACGACAACAC 


840 


CTAACCAGTC 


TCAGATTAAT GATATGGTTC 


AAAATAGTTC 


GTTGATTTCC 


AGTTTCTTCT 


900 


TGCTAGCCTT 


GCTTGCTCCG ATTTGTGAGG 


AAATCTTGTG 


TCGTGGGATT 


GTTCCTAAAA 


960 


AGATTTTCCG 


AGGCAAGGAG AACTTGGGAT TTGTAGTCGG 


TACGATTGTG 


TTTGCTTTAT 


1020 


TGCATCAACC 


AAGTAATTTA CCTTCTTTAT 


TGATTTATGG 


AGGTATGTCG 


ACAGTTCTAT 


1080 


CTTGGACAGC 


CTACAAGACC CAACGTTTGG 


AAATGTCGAT 


CTTGCTTCAC 


ATGATTGTTA 


1140 


ATGGGATTGC 


TTTCTGTTTG TTGGCTCTTG 


TGGTGATTAT 


GAGTCGGACA 


TTAGGAATTT 


1200 


CTGTTTAAAA 


GTTTTTATGT AGGAACCGAC 


CTCTTTCTAC 


CAGGGAAAGA 


TGAATGCAAT 


1260 


CGTGTCCATC 


TTTTTCTTTT TATGGTAAAA 


TAGAAAAATA 


ATATGATGAA 


AATCCTTGAG 


1320 


GGAGTGACCG 


ATATGTCAAG TAAAGCCAAT 


CATGCAAAGA 


CAGTTATTTG 


CGGAATTATC 


1380 


AATGTAACCC 


CAGACTCCTT TTCGGACGGT 


GGTCAATTTT 


TTGCTCTTGA 


GCAGGCGCTC 


1440 


CAGCAGGCTC 


GTAAATTGAT AGCAGAAGGA 


GCCAGTATGC 


TAGATATCGG 


CGGAGAATCG 


1500 


ACTCGGCCGG 


GAAGTAGCTA TGTTGAGATA 


GAAGAGGAAA 


TCCAGCGTGT 


TGTTCCAGTG 


1560 


ATCAAAGCGA 


TTCGCAAGGA AAGTGATGTC 


CTCATCTCTA 


TTGATACTTG 


GAAGAGTCAA 


1620 


GTAGCAGAGG 


CTGCTTTGGC TGCTGGTGCC 


GATCTAGTCA 


ATGATATCAC 


TGGTCTTATG 


1680 


GGTGATGAGA AAATGGCTTA TGTGGTAGCT GAAGCGAGAg CGAAAGTGGT 


CATCATGTTT 


1740 


AACCCAGTTA 


TGGCTCGACC TCAGCATCCT 


AGTTCGCTTA 


TCTTCCCTCA 


TTTTGGTTTT 


1800 


CGTCAAACCT 


TTACAGAAAA AGAGTTAGCT 


GACTTTGAAA 


CATTGCCAAT 


CGAAGACTTG 


1860 
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ATGGTGGCTT 


TCTTTGAACG 


AGCACTAGCG 


AGAGCGGCAG 


AAGCTGGTAT 




174V 


AATATCCTGT 


TGGATCCAGG 


AATTGGCTTT 


GGTCTGACCA 


AGAAAGAAAA 


i L * V»L 1 1L1 1 


1 Oftft 
IjOU 


TTACGGGACC 


TGGATAAACT 


ACATCAGAAG 


GGCTATCCAA 


TCTTTCTCGG 






AAGCgATTTG TCATCAATAT 


CCTAGAGGAG 


AATGGTTTTG 


AAGTCAATPf* 


1 vwtviAL nvtnu 




CTTGGTTTCC 


GAAATCGGGA 


CACGGCTTCG 


GCTCATGTAA 

**** A, A W A 




1 V»Lv*AV»ALAVj 


2 160 


GGTGTAGAAG 


TGGTGCGCGT 


GCATGACGTA 


GCTAGTCACA 




fry* » * a T»fV" /"*/■"• 
IwAAAl 




TCTGCCATTC 


GTCTGGCTGA 


TGAAGCGGAA 


AATTTACATT 

mi A A A A 




1 AAA 1 AAuAT 


22BO 


GAAAGAAATT 


GAAAACAATC 


AGTGGATTGC 


TAACTACCGG 


Af flrtATT^A if* 




2340 


CTTGGAACGA 


ATGGTGGAAC 


* \J A A fl\J w A L A 


GCGTGfZCAAT 


i LA 


AAlTlAAGGT 


2400 


CCTCCATATC 


GGAGGGACTA 


ACGGCAAGGG 




v*L 1(111 i\rn 


AaaAvJATGCT 


2460 


AGAAAAGCTA 


GGGTTGAGAG 


1 lUw^VJ 1 V» 1 X 




i a i v. l la rrc 


ATTACACAGA 


2520 


CCAGATTAGC 


ATCAATGGGG 


A. ATffi ATPTf* 


nunnuV.unuu 




TCATGGCAGA 


2580 


CTATCAGTCT 


TTGCTGGAGG 


uaaia/mjlvjo x 


ffipp » ITTTl 
LuLLAAi I in 




CCGAGTTTGA 


2640 


GATTATCACA 


GCCCTGGCCT 


AlAjAL lnV.1 I 


i \A-L rLAl>AV* 


LAAOTAGATG 


TGGCCATCAT 


2700 


GGAAGTTGGC 


ATGGGTGGAC 


111 lUUAiAU 


1 ALUM tu L\. 




TTTTGACAGG 


2760 


AATTACAACT 


ATTGGCTTGG 


n 4 V*A..l VJ 1 \JU\> 




wALALU 1 ll^lj 


AGGTCATAGC 


2B20 


AGAGCAGAAG 


GCAGGTATTA 


TCAAACAAGG 


GATGCCCTTG 


GTAACAGGGC 


GTATTGCTCC 


1 A A A 

2880 


AGAAGCCTTG 


GCTGTGATTG 


ACCGCATTGC 


GGAAGGGAAA 


GATGCGCCGA 


GACTTGCCTA 


2940 


CGGGACAGAT 


TATCAGGTTC 


GTCATCAAGA 


AAGTGTGGTG 


ACAGGGGAAG 


TCTTTGACTA 


1 AAA 


TACAAGTGCT 


GTCAGACAAG 


GTCGCTTCCA 


GACTAGCCTG 


CTTGGTTTGT 


ACCAAATAGA 


1 A£ A 

3060 


GAATGCTGGG 


ATGGCCATAG 


CTTTACTTGA 


TACTTTTTGT 


CAAGAAGATG 


GTCGAGAGCT 


1 1 OA 

J120 


AGCAAGCAAT 


GATTTTCTI*G 


GTCAAGCCTT 


GGAAGAAACA 


AGTTGGCCAG 


CGCGTTTGGA 


1 1 OA 

3 180 


AATCGTGTCA 


AGAGATCCCT 


TGATGATTTT 


GGATGGAGCC 


CACAATCCCC 


ATGCTATCAA 


3240 


GGCCTTGTTG GTAACCTTGC AAGAACGTTT TGCGGATTAT CATAAGGAAA TCCTCTTCAC 


.UOO 


TTGTATCAAA 


ACCAAGGCCT 


TGGAGGATAT 


GTTGGACTTG 


CTGGGAGCCA 


TGCCAGTTAC 


3360 


CGAGCTTACT 


CTAACACATT 


TTGCGGATAG 


TCGGGCGACG 


GATGAAAACG 


TGCTGAAAGA 


3420 


GGCAGCTAAG 


TCTAGAAATC 


TCAGCTACCA 


AGATTGGCAT 


GATTTTCTAG 


AGCAGAATTT 


3480 


GACAGATAAA 


AAAGAAGAGA 


AACAAACAGT 


TAGGATTGTC 


ACAGGTTCCT 


TGTATTTCTT 


3540 


GAGCCAAGTG 


AGGGCCTATC 


TGATGGAGAG 


GAAGAACGAG 


AATGGATACA 


CAAAAGATTG 


3600 


AAGCGGCTGT 


AAAAATGATT 


ATCGAGGCTG 


TAGGAGAGGA 


CGCTAATOGC 


GAGGGCTTGC 


3660 
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AGGAAACACC TGCTCGTGTA 
CAGCAGAGGA ACATTTGTCA 
AGGATATCTT TTTCCATACC 
ACATTGCCTA CATTCCAGAT 
AAGTTTATTC GAAAAAACCA 
TGATGGACTA TCTACGTGCT 
TGAGTATGCG TCGTGTTAGA 
TATTTGAAAC AGATAAGGAT 
AATCCGCTTC AAGCGGATTT 
ATTTGGAAAT GTTTGCCTAT 
TTGTCGTTTC AGCCATCCTA 



1297 

GCCCGTATGT ATCAAGAGAT TTTTTCAGGT CTTGGTCAAA 
AAATCCTTTG AAATTATTGA CGATAATATG GTGGTAGAAA 



ATGTGTGAAC 

GGTCGTGTGG 

CAAATTCAAG 

AAAGGAGCCT 

AAACCAGGCA 

CTCCGTGACC 

TTCTAGAAAG 



ACCACTTCTT 

CAGGCTTGTC 

AACGTTTGAA 

TTGTTGTCAT 

CTGCAACCTT 

AAGCTTATCG 

GAATCATTAT 



GCCATTTTAT 

TAAGCTAGCC 

TATCGAAGTG 

TGAGGCGGAA 

GACGACAGTA 

TTTAATGGGG 

GGATCAACTG 



CCTAGAGCGC 
CGTACGGTTG 
GCCGATGCCT 
CATATGTGTA 
GCTCGTGGTC 
CTATAAAAAG 
CAGATTAAGG 



CAGCCTCTGT CCATTACGGA 
GTGAAGATTT GATTGAAACG 
CTCTTGTCCA AGAAATGAAG 
TAGATACTTG CTCGGTAACC 
GCAATATGGG AGATAAACAA 
GCATCCATAT TCTCAAAGAG 



AGGATAGCTT 
TAGAAACCTT 
CTCGTTTGAT 
TCATATTGCC 
TGCGCCTCAT 
GAAAAAATAG 



TGCCAATCAA 
GTTAGCCATT 
TGATTTGGAC 
TCATCCTTAC 
TTTATCCATC 
AAAAACTCTA 



TCTTCGAAAA TCTCTTCAAA 
GGCTAGCTTC CTAGTTTGCT 
TGGGAGGAGG ATAGTTTCTC 
AGGGTCGAAA GGATGGTTAA 
GTGGAAGGTA ATCTTTCCTT 
AAAGACATTT TTTAAGAAAT 



CATGGTCTTT 
TCCTATGATA 
GAATTGTGTC 
GTAGCCTATA 
TTGGAACTGA 
ATTCATCGCC 
GCAAACTTGA 
TCCAGTGTCT 
GTGGTTGAGG 
GAGTCAGAGC 
TTGCTCTTTG 
ATAGCGGAAC 
CGATATTAAA 
GTTTTCAGTT 
CCACGTCAGC 
CTTTGATTTT 
TACCGTCCAT 
AGTCAAAATC 
GGTTATTAAG 
GGTCGATGAT 



TTCCTAGTGA 
TGACCAAGGC 
AGCAGTGGAC 
AACTGGTGGA 
AAAAACCTTG 
GCAAGCAACG 
AGCAAGCCAT 
TAGCGACGGA 
TGGAAACCTG 
TGGGACGGGr 
TGGAGGACCA 
GCCTTTTTGT 
ACAACCGATC 
ACTTGCAACT 
GTCGCCTTAC 
CATTGAGTAT 
GTCTAAAACC 
AATGGCTGTA 
CAATTGAAAC 
ATACCAAAAA 



GAAAGAATTG 

AGCTACAGAC 

GACTTGGTTT 

ACGTACCTTT 

GGCACCGGTG 

AGCCTTTATC 

TGACAAACTG 

GCCTTGGGGT 

GCTACCAGCA 

GAGAGAAGTG 

GATCCTTTAT 



GGGCAGAAAT 

TTGGATTTAA 

CAGGAAACGA 

GAGTTTTATC 

CATTTGTCAC 

GCCCTAGGAA 

CGAGCTCGTG 

GGAGTGGAGC 

CAAGACTTGT 

CATTGGGGAC 

ACAGACGACC 



CCTTGAGTCt 
CGCAACTTGT 
GAAGGCTAGA 
CGTACTCAAG 
TAAAATAGGT 
AGTACTCTTG 
GGGAGGTGTT 
TCGAGTTCTT 
GAGTCAATGA 



TACAGGAAAT 

ATGATGCTTT 

GTTTTTATAC 

TACAGCTTGC 

CATTTTCTTC 

GGGGATAACG 

GACTTGAAAA 

CTTCCAATTC 

TGTCATCAGG 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 
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CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 5460 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATAGGA TTAAAAATCA AGAAAAGGTG 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTGATTATAT AACTTTCAGT TACTTTGAGA 5640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 5760 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5820 

AACAACATTT ATGGCTAAAC CAGGCCAAGT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5922 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION : SEQ ID NO: 268: 

TAACTATCTA CGATGAGCTG TTGTGATTCT CATTAGTTCC CCTTTCCCAA GAGGCATAGG 60 

GGTGCGCATA ATAGATGTCC TCCTCAGAAA ATATATCAAA CAAGCGATTG AATTCCGTTC 120 

CATTATCTGC CGTGATGGAA AGAATCTTGT GTTGTTTTAA GATGAGTTTT AGAGCCTGAT 180 

TGACCACCTC AGCACTTTTA TTTGGAATCA ATCGGATGAT CTGATGTCTA CTCTTTCGAT 240 

CCGTCAAGAC AATCAAGCAG TAGTTTTTCG ATCTCGTAAG TAGAACCGTA TCAATCTCAT 300 

AATGCCCATT CTCCAAGCGA AGATTGATAG CTTCAGGCCG CTGTTCGATG GATTGACCAG 360 

CAGGTTTAAA GTTGGTGCTA GCCTGTTTCT TAAGCGCTTT TCCTTTTCTA GGGTAAAGCA 420 

AATCCTGCTT GCTTAACCCC AATTTTCCAT GATGAATCCA ATAGTAAATG GTTGAAATTC 480 

CCACGTTAAC CCCTTTAGCC ATAACCATCA TTTCAGGCGA AAATTTTTGG TTATGATAGT 540 

GGAGAATCTT TTCCTTTAGT TCCTTGGTCA AGCTTGATTT CTTGACCGAG CGCTTGCGAT 600 

TGTTTTCATA AGACTGTTGA GCGTAGTCGG CAGAATAAAC CTCTTTGAAG CGCC C TTTTC 660 

CAAGACATTG TCGGACTGTC CCACGCTTGA TTTCAGTGTG ATAGTTTGAG GAGCTTTTCC 720 

AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCTTT CGATTAAGCG 7 80 

ACGGCTATCG ATTGTCAAAT GTTTGGCTTT TGTAGTATAA TTGTCTTGCA TCTCTGTGCC 840 

TTTCTTGTGT TTGTGGTTGA ACAACAAGTA TAACACAGAG GTGCTTTCTT ATGCCTACAA 900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA TTTCACTCTA TTCTGAAAAA CTTGTGTATA 960 

TTTACTGAAG CTAGCAAGTC TTACCTGTAA ATTTAATGAA AGCAACACAA AATCCGAGAG 1020 

GGGAATCTCG GATTAATAGA TAGAGAGTTT TTACTTTAAA TAAATTGTTT AAAATATCAA 1080 

CAACATCACT TCTTTTCTTA ACCTGATAAG TCTTGATTCC TAATTTTGGG GCTACGATTA 1140 

TATTGTCCTC AATATCGTCT AGAAAGACAC AATTTCTAGC TTATAACTGG TATTTATCGA 1200 

TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTG CGAGCCTCTC TTTGTATTAT 1260 

ACCTCCATAC TCACCTTACA GATTCTTTTG GTAATAATAT CTTTGCCTAA TGTAGAGACA 1320 

GTCTTGCAAA GAAAAAACTT CCTTGTAGCC ATGTTTCTGA TAAAAGTCCG GTGCCTGGAA 1380 

CTGCTAAGTA TTGACAAAGG CAAAACAACA ATTTCGATTC TTAGCTTCAC TTTCTGCCTG 1440 

TTCCAATAGT TTTGAACCGA TTCCTTGCCC TCGCAGTTCC TCTTTTACAA ACAAATACTC 1500 

GATTTCTAGC CAATTTCCAA AAGTCTCTGC TATCAAACCT GCCAGGAGAT TGCCCTTTTC 1560 

ATCTTCGACA TAAAGATTAA GTGGCTCACT TTCAGCCTCT TCTCTTTTTG AACGGTTATA 1620 

AACACGAATC AGATTCCCTA TTTCTTGCGA TTTATGTGAT TCCTTATTTT CCAATCTAAA 1680 

GTATAGTGAA ATGAAATAAA ACATGCGCAA ATCGATTAAG GAATTTAATC TAATTTCTAA 1740 

CAATGTCTTA GAAATCAAAG TGTACTATTT TAACTTCAAT GCACTATACA TCTAATACTC 1800 

AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGGT TGCTCAAAAC ACTGTTTTGA 1860 

GGTTGTAGAT AGAAcTGACG AAGTCAGCTC AAAACATAGT TTTGAGGTTG TAGATGAAAC 1920 

TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG AAGTCAGCTC 1980 

AAAACAGG 1988 
(2) INFORMATION FOR SEQ ID NO: 269: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGGC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

AGTCGGTGCA GACATCGACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGGA 240 
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AATCTCAACC ATGAGTTTTG ACAATGTTTT GACCAGTCTT CAAACTCGTA AGGCTGACCT 300 

AGCAGTTGCA GGAATTAGTG CTACTGACGA GAGAAAAGAA GTCTTTGATT TTTCAATCCC 360 

ATACTATGAA AACAAGATTA CTTTCTTGGT TCGTAAGGCT GATGTGGAAA AATACAAGGA 420 

TTTAACTAGC CTAGAAAGTG CTAATATTGC AGCCCAAAAA GGGACTGTTC CACAATCAAT 480 

GGTCAAGGAA CAATTGCCAA AAGTTCAATT AACTTCCCTA ACTAATATGG GTGAAGCAGT 540 

CAATGAATTG CAGGCTGGAA AAATAGATGC TGTTCATATG GATGAGCCTG TTGCACTTAG 600 

TTATGCTGCT AAAAACGCTG GCTTAGCTGT CGCAACTGTC AGCTTGAAGA TGAAGGACGG 660 

CGACGCCAAT CCCGyTCCTC TTAGAAaATA CTGATGATTT GAAAGAAGT 70 9 

[2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 
{3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

TATAAAATCT TAAGTTAAAT GATTTCAAAA TTCAGAAAGG GATTGCTTTA TGCAGTTCCT 60 

TTTTATTTTA ACAGGAGTGA AACTATACTG TTTCTAAATT GTGAATCAAT CAAAACTGAT 120 

TGTGATGGGG CTATTCTAGC TTTAGAAACC TTCAAAAATT AAAATTTAAG GCAATCAATT 180 

ACTTGGAAGA GTATGAAAGC ATTTAGTTTA TAGGAATTCT AGGTCTAGAA TTACATATAT 240 

ATATTTATGA AGACGGGGTG TTCGATAGTT AGTATTGTTC TATTCTGAAA GATTTGAGCT 300 

CTCAGTTGTA TAGAAAGTGT TCGAATTTTT TTAAGTGATT AAATTAGTTA ATTGTATGAG 360 

GTGCTTTATG ATATAATGTT CTTAATGAAT TTTCAGAAAG GAAAACCTCA AATTGTTCTA 420 

CAAATTTCTA CTCTTCGACC TCGACCACAC TCTTCTTGAT TTTGATGCTG CTGAGGATGT 480 

GGCTTTGACC CAACTTCTAA AAGAAGAAGG AGTTGCGGAT ATTCAGGCTT ATAAAGATYA 540 

TTACGTTCCT ATGAACAAGG CTCTCTGGAA AGACTTGGAG CTGAAGAAAA TCAGTAAACA 600 

AGAGCTGGTT AACACGCCCT TTTCTCGTTT ATTTGCTCAT TTTGGACAGG AAAAAGACGG 660 

TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG CAGGGACAAA CACTATCGGG 720 

CGCTCATCAT CTCTTGGACA GCCTCATTGA GCGTGATTAT AACTTGTATG CTGCGACAAA 780 

TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT GGTCTAGCAC CTTATTTCAA 840 

TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG GATGCTCTTT TTTATGAAAA 900 

GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACG CTGATGATTG GAGATTCTCT 960 
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1301 








AACCGCCGAC 


ATTCAAGGTG 


GCAATAATGC 


GGGGATTGAC 


ACTATCTGGT 


ATAATCCTCA 


1020 


TCACCTCGAA 


AATCACACAC 


AAGCCCAGCC 


GACTTACGAA 


GTCTATTCTT 


ACCAAGACTT 


1080 


GCTGGATTGT 


TTAGATAAAA 


ATATTCTTGA 


AAAGATCACA 


TTTTAAAGGA 


GACGAGCTAA 


1140 


TGACTACAAA 


AAAGCTAATA 


TTACTATTGA 


AGAGTACATT 


GAAATGTCTG 


AAGTTGATTT 


1200 


TAATGAAGCT 


GTTAATTATG 


AATTTACATC 


TGACACTTGT 


CAATTAGCAA 


ATAGTATTTA 


1260 


TCAATCTCTT 


TTTAAGTTTT 


TTGATAAGAA 


AAATTTCTCT 


GGCGATTTAA 


TTTTTACTTG 


1320 


GAAATCTCCA 


TCATTAGTCA 


AAGAAGGGGA 


TTATATTGGG 


AGAAGGGATT 


CACAAGTAGA 


1380 


TAATCTTAGA 


GTAATAGGAA 


ATATATTTCC 


GAATTATCTT 


ACTAATCGAA 


AATATAGCCT 


1440 


CAATATGAAT 


CGTAATGGCT 


GTATGGGAGA 


TTTTCCTCAT 


GACTTTTTTG 


ATATATACCT 


1500 


AGATCATGTA 


GCAAAATATG 


CCTACGAACA 


AAAAGTTAAT 


AATATTAAAG 


AGTATTATCC 


1560 


TTTAAAAAGA 


GCGATTTTAC 


ACCAAGAGAA 


TGCATTGTAT 


TTTCGATTTT 


TTTCTAATTT 


1620 


TGACGACTTT 


TTAGAAAAAA 


ATTATTTAAA 


GACTATATGG 


CAAGTTTCTA 


AAGAAACTCC 


1680 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 120 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 240 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 300 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 360 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 420 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 4 B0 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


272: 






CCAGCAAATC 


AATAACTGCA 


ATTGCTATAA 


AATGGATTCT 


ATAGAGTTTT 


TTCATGACAA 


60 


GACCTCCCTC 


TTTTATCTAA 


CTTCATTCTA 


CTCCAAAAGA ATGGGAGTTA 


CAACTAAAAT 


120 


GATAAAAATA 


GCAGAAGGGA 


GATTCTCTTA 


AGTTGGCTAG 


TATTCTTTAT 


TTGAGTTTCC 


180 


TTCTATTATC 


TAACTTCTTC 


ATCATTCCAG 


ACAAATAAAG 


CTCCGATTGC 


ATTGAGGATA 


240 


TAAAAGATGT 


ATTTACCGAT 


ATTGGCGAAG 


TTTCCTTGAA 


TACCAGCTTT 


TGTCAGCTGA 


300 


ACGAAATTGT 


AAATCAACCA 


AAAGCCCCAC 


TGAGTTGTTA 


will k /v% 1 Uk* 


ATTCAAAGCA 


360 

V V 


TTGGCAATGA 


GGGACAGTGC 


AAAGGCAATA GTTGTTACGT 


AGGCAAGGAG 


ATTCATCTTG 


420 


CCCCCATATC 


CGATATAGTT 


GGTCACAAAG 


GCAAAGAGGA 


AGGCGATGAT 


GGAAATGATC 




ATGGCCGCCA 


ATTTTACCTG 


TTTTTGGCTC 


ATTTGGTTGG 


GTCTGCCTTC 


TTGCGAAGCT 


540 


TCCCACTTCT 


TTATAGCAAA 


GGTATAAATG 


AGGAAGGTGA 


CGGGATAGGT 


AATGATGGCC 


600 


GCCTTATTTC 


CAAGGATATA 


ATCAATAGCA 


CCGGACAAAA 


TGGTATTAAC 


AATACCAAAG 


660 


TAATTTCCCC 


ATTTGCTTAA 


TTTCCCCGTG 


AAACGAGTGG 


ACAACATGGA 


AATCCCAACG 


720 


TTGGTTACGG 


AAATCAATCC 


AAAGGGTACA 


AGAGCTGTCC 


ATGATCCCCA 


GTCTACAAAT 


780 


TTATCGAGGT 


GTGAGTTGAG 


GTAACCAGAT 


GCAATCGCAA 


TCCCAACGAC 


CAAAGCAACC 


340 


CCGAAGAGGT 


CAAACTATTT 


AGATGTAGCA 


AAAATTTTTA 


GTGATTTTTT 


CATAGGTTAA 


900 


ACTACCTTTC 


TTTTTTTCAA 


ATATTCTCCC 


ACCAAATGAA 


AGTAAAATAA 


AATGATAGAA 


960 


ATAAAACCCT 


GAAAATAAAG 


GTTCTATAAT 


ATTTGTAGTG 


GGTAAATCCA 


CTATAGATAT 


1020 


TATGGAGCCT 


ATTTTATTGT 


AGAAAAAAAG 


TCCCATATGA 


CCTATAATGA 


AAAGCGACAA 


1080 


AACAACTCAT 


TAGAAAGAT 










1099 


(2) INFORMATION FOR SEQ ID NO: 273: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA 
TGAAATTGAA 
AGCTCGTATC 
GCAATTAGAG 
CCTCTATGTT 
AGTCTATCTT 
ACCAGCAGGA 
AGTGATTTAC 
TCATACTCCG 
TGAGACAGTT 
GGTTCAGGTG 
GATTGTGGGT 
TGCTAATTAC 
CGAAGAGGGG 
GACGACCAAT 
GCTTAGTCAA 
GTATAATACG 
AGATATCATT 
CCTCCTCAAA 
TCCCAAGTCC 
CAAGTTTACC 
GACAGATAAT 
GGTCGATAAT 
TCAGAGATAT 
CAATGTACAC 
TGCAGAGCAA 
ATGTATAAAG 
ATTCCCTTTG 
CTAGAATATG 



CGTGAAAAGG 
TATTACCATA 
GAGGGAATCT 
CGGAAAGCTT 
CGAAATGATT 
TCTACTAAAG 
AATAGTTTTG 
ATCTCCTTGG 
ATGGCAGTAA 
GATAAGGAGA 
GCAGTTCCTA 
TTGAGCCTTC 
CAAAAGCAGG 
CGTCGGATTG 
GATATGTTGG 
AAAGATGCCA 
CTGGAGTTCT 
TATGAATTCA 
CAGGAATTAG 
ATTGCCTATG 
TTGCAACCGC 
GTGATTAGCA 
GGTAGAGGAA 
TTTGAACACC 
GAGCGTTTTG 
GCCGGTGTTC 
TATTATTAGT 
ATAAGTGGGA 
TTCAGGAAAA 



AAGCCCAGAG 
GAGAGTCAAC 
ATAAATACTT 
CGCCTTATAT 
TTGTAACTGG 
ACAAACGTAG 
CCATTCCAGT 
ATCCTGCTGT 
CAGTGACCTC 
GTGAAAATTG 
AAAACTTTGT 
TCTTTATTGT 
TAGTGGATTT 
ACATTTCCGA 
ATCGATTGGA 
ATATGCGAGC 
TGCGCATGTA 
GTAGTCTCTT 
AATTTTGCCG 
GTTTCAAGAT 
TGGTAGAAAA 
TCAAGGCTCT 
TGTCGGCTGA 
AAGCCAGCTA 
TGCTCTATTT 
AGTATCGTAT 
AGATGATGAG 
TATGGAGGTC 
TCCTGTCGAT 



1303 
AGTAGCCAGG 
CCAGATAGCT 
TAGCCTTAGC 
ATCAGTCTCT 
GGTGGCCATT 
GkkAGAAAAA 
GTCAGATCCA 
TTTATACCAT 
ACCTTTTGAT 
GCTAGTTGGC 
TTTACAAGGA 
CATTCTTTAT 
AGTAGAATCC 
GAAAGATCAG 
AAAGAATATC 
CTTGCAGGCG 
TGCAGTTATG 
GCGTAACAAT 
TAAATACAGC 
AGATCCAGAG 
CTATTTCGCG 
TAAACAGGAT 
AAAGTTGGCA 
CAGTGATCAA 
TGGAGACCGC 
TACAATTCAA 
TACATGGTGA 
GTCGCAACAG 
GTCATCATTT 



TGTACTGCTA 
CAGGCTTTAG 
ATGCCAGACT 
CTGTATGAAA 
CCTTTTCAAG 
ATCAGGGCTG 
GTCTCAGATC 
GCCATTGATA 
ACGGAGATTT 
TTAACTTCTC 
ACAGTGACTA 
CTGACTTTGA 
ATTCAAGTCA 
GAATTACTCC 
CATGATATTT 
CAAATCAATC 
CAGAGTCAAG 
ATTTCCGACG 
TATCTCTGCA 
TTAGAGAATA 
CATGGTGTTG 
GGTTTTGTGG 
AATATCCGAG 
AGGCAGTCTA 
TATGCCATTA 
GATGAGTAGA 
CAGAAGGTCT 
CCAGTCATGC 
CCGATGTCAA 



GAACAGTGAG 
TTGAAAATCA 
ATTTTTACTG 
ATGTTGATGA 
ATTACAAGGA 
AGGATTTCAA 
AAGACTTAGG 
ATACTAGAGG 
TTCATATGGG 
ATGGATATCA 
GCTCTGCTTT 
GCCAGACTTT 
TTGCTCAAGG 
TAATCGCGGA 
ACCAGTTAGA 
CTCATTTTAT 
ATGAGTTGGC 
AAAGAGAGAC 
TGGTTCGCTA 
TGAAGATTCC 
ACCACAGGCG 
AAATTTTGGT 
AAAAATTAAG 
TCGGGATTGT 
CTATAGAGTC 
AAGGGAGAAA 
GAAGCGTTTG 
CGATGAAGCT 
TATGCCAGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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AAAACAGGCC TTGATATGAT TCGGGAGATG AAAGAGATCT TACCAGATGC TGCCTATATC 1B00 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT GTAAAAAGAG CAATGAACCT TAGTGTGGTG 1860 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGAGGCT 1980 

GCATTTGTTA GTTATTTAGG GGATAAGGAG AATTGGTGGA TAGGTCTATC CAAGGAAAAA 2040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CTCTTTTACG GTTCTGTAAA TCTGCAGCAG 2220 

TCTGAGAGTC TCTTTGCCTA TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 2280 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2340 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 2400 

CATCTCAAAG CTGATGATAT GACGGACATT GTCAAAACCA TTCATGCTAT TCAATCCTTC 2460 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2520 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATTACCAAAA AGAGCTTTCC 2580 

CTCAAGGATA TCAGTAAGGC C C TC TTT ATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 2640 

CGTGAAACCG ATTCGACCTT TGCAGAGTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 2700 

CAGCTCTTGC TTTCAACTAG TGA 2723 

(2> INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAACTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 360 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 



WO 98/18931 



PCT/US97/19588 



1305 

TCTAGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CCACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTCGAGC AATTCCTGAT GCCACTTTGA GGTTACTGAT ACGATTGTGG 540 

GCGATAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCG 600 

TGAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 836 

(2) INFORMATION FOR SEQ ID NO: 275: 

U> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2335 base pairs 
<B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAGCTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTGTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 300 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TTTTTTTACG 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTTT 420 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 480 

TATATGATTG AGCTAAGTAA GAATCCTGTT TTGGGTCTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGGCAAT 600 

CTAATTGATC TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTGCCTCTTT AGGGGCTAAT ATTGCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCGG AACAGTTGTC TGCGTTATTT TTCTACTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAAAT CTAGCACCGG AAATGACCAT CGCCTTTGCT 840 

CACGGAACCT TTAATATTAC CAACACCATT GTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 
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CTTGATGAAC ATTTCATCAA ACACGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAACAG 1020 
CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 
GACTTGGATG AAAAACTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 
GATCAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 1200 
AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCCTCATT TGGAACGGAT TGGAGACCAC 1260 
ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 1320 

GCCGCCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG CTTTTTGCAG AACAAATCTA AGAACCAAGA 1620 

AGCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 1740 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA GTCTTTTTAC 1920 

CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 2335 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 
ID) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 



60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTCTACTTG GGTACAGCAA GGGAGCTTAA 120 

GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG ACTTCATCAT TTCGAGAAGT 180 

GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 240 

AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 300 

GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TGTAGAAGTT 360 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCCTGG AGCAATGGTA 420 

TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTCTTT TCGCACGTGG TGCAAAAGCT 480 

GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 540 

AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC CTGATATGAT GGCTCTTGTT 600 

GGACGTCTTG" GACGTGTCCT TGGACCACGT AACTTGATGC CAAACCCTAA AACTGGTACT 660 

GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 720 
GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 
(2) INFORMATION FOR SEQ ID NO: 277: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2643 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



752 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GTCAACATTG ATTTCAAGGC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 60 

AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 120 

AGAAGTAGAG GTGTACTATT CT AGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 180 

CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGAGCTCTTT GATTTGCCTC 240 

GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 300 

CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTCG GGGTTGATAA AAAATCTCCC 360 

GATATGAAGA TTGCTGTCTT CTAGCCTGAT TTTCTGGATA AATCCCACAA ACTCTTGTTC 420 

ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTGG 480 

TCCCATCCAT TCTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 540 

AGCGTTTTCT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 600 

TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATGATTGAT 660 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2520 
ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 
GTTTCCGATA ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 
GGG 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT ATCGATGATA TCATCTCAGG 60 
GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTGCTAGCC TATGCTGCTG TGACCAAGAG 
TCCAGAGGAG CCCTATGAAG CTATTTATGA GGGAAACTGG CAAGCTGGAG AGTCAGAGTA 

TCTAGTCTTT CACCGTATTG CTGTGGCAGC AGATCTGCAG GGAAaAGGAG TTGCTCAAAC 240 

CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT TTTCGCTCAG ATACGCATGC 300 

TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT TTTAAACAAG TCGGTAAGAT 360 

GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG AAATAATGCA AAAGAAGTAT 420 

GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC TTGTAGCTGA TGACCATTAT 480 

TTGTATGGAA TTTGGGTTCA GGAGCAGAAG CATTTTGAGA GGGGACTAGG AGATGAAACG 540 

ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCACTTA TT 58 2 
(2) INFORMATION FOR SEQ ID NO: 279: 

<i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 279: 

CCCAAGCTAC TAAGAGACTA AAACTTGCTA GAGAAGCAAG AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 120 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 180 
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CCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 240 

TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 300 

CACATCCCTT TCCTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 360 

AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 420 

AAAGAGGATT TGAGGTCGAT ACCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 480 

AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 540 

GCGASCAGCC CAAT S54 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNE S S : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCGGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 120 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGCGGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 360 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 480 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 600 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TTACCGAGAA GATTTGTCAC AACCACAGGG GGATGG 766 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi> SEQUENCE DESCRIPTION: SEQ ID MO: 281: 

CCGGCCACCG TTCCATCCAA CTTCACAGGT GTGCACTTGA TTGTGTATGT AATTGTCACT 60 

AACGGTAGAA TTTCACCTAT CCCTCCTATC TGCTCGCACT ACCCGCAGAC TTTCTGAAAG 120 

AAGAAGATAA CCTACTTATC CGTTGCTATG ATTATACTAA AGTTTCTACT TTTTTGCAAA 180 

TAGATTTTTA AATTTTTGGC TAATTGTCTG AATCAGGGTC GGAAGTTTGA CGACCTTGTC 240 

ATTGCCTAGT TTTTCGCGTG CAATTTTGAG AATGGCACCT GAGTCTTTTG AAGCAAAGAG 300 

GAATTTTCCT TTGTCTGTAA AGACTTCGAA GTGGCGGCTG ATTTTGCGTC CAGTGACATT 360 

GGCTCCAATC TGATTGATAT GGCTCCAAGG AATCTGGATA AATTGTTCGA CATTGACATC 420 

TGGGTAAAAT TCCAAAGCCT GATCTCCGAC AAGGAATTTC CCAACTTTCC CAGCGATAGA 480 

GAGGTAGGAA GTGCCTGTCG TACTGAGGAG TACTGTTTTG TTAAGTGATT GGGCCATGCT 540 

TAGTCTTCCT TACTTTCTCC AAAAAAGGCA TTGTAGAGGG CTTTAATTGC TGCTTTCTCT 600 

TGGTCTTTAT TGACAACAAA CATAATAGAA ACTTCACTAG AACCTTGAGA CATCATCTGG 660 

ATGTTGATTT TGTTTTCAGA TAGAGCGCGT GTCGCACTAC CAGTCACTCC GATATGGCTC 720 

TTCATTTTTT CACCAACAAT CATAATGATA GAAAGGTCGT GTTCGATTTC TGCATGATCT 780 

ACTTTAGCCT TTTGAACCAA CTGACGCAGG ATTTCTTCTT CCTTGATGGG AGTTAGTTGG 840 

CGAGAACGGA GAATGATAGA AAGAwCGTCG ATACCTGTTG GCATATGTTC CCAACCGATG 900 



901 



12} INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 120 

TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 240 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCCA ACTCTTCCCC 300 
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ATTTCTGGGA GTTGGGGTAA AAATCTTCAC TGGACGTTCC AACTCTTCCC CATTTCTGGC 360 

AGTTGGGCTG ATACAGTCTC CCACACTGTA TCACTCCTCC ATAAAGCTGT TGAAGACTTC 420 

TTCAATCATG TTCCATTCCT CTTCTGAGTC TTCTGGGATT GGTTGCAATT CCCCTTCTGT 480 

TCCATCTTCG TTTTCGATGA ATGAGTAAGC TTGGATTTCA ACTTGTCCGT CTTCGTCTTC 540 

TTCTGCGTTA ACTGGTACTA GAAGAACATA GTTTTTACCA AATTCTTCTT TTCCATCAAT 600 

TGTCAAAAGG ATTTCAAACA AGGTTTCATT TCCTTGCTCA TCTACTAGTG TGATTAGTTC 660 

ACGTTCTTCG TGGTCGTGGT TATGATCGTG TGACATAGCC TCGCCTTTAT ATTAAAATTT 720 

TCTATCTAAA TAATTTTCTA AAATCAGCTG AGCTGCTAAC TTATCAATGA CTTTCTTGCG 780 

CTTATTGCGA CTGATATCTG CTTGTTCAAT CAACATGCGC TCAGCAGCCA CTGTTGTCAA 840 

GCGTTCATCC TGATAGTCTA CTGGTAAACC AAAAAACTCT TCTAGCTTTG CTCCGTAGCT 900 

TGACTAGCTT CTACGCGCGG TCCACTTGTA TTGTTCATGT TTTTAGGCAA GCCCACTACA 960 

AATCGTTCCA CCTTGTAAGT ATCAACCAAT TCCTTAACGC GGTCAAAACC AAATTGGCCT 1020 

TGTTCTTCAT TTATCTGGAT GATTTCAAGC CCTTGAGCTG TAAAACCAAG CGGATCGCTA 1080 

ATCGCCACCC CTACCGTTTT TGAACCGACG TCCAATCCCA TAATTCTCAT AGGTTATAGA 1140 

TCGACTCCTT GTCCTTTGAG GTAGTAGCGA ACCAATTCCT CAACGATTTC ATCACGCTCA 1200 

TACTTACGGA TTTGATTTCG TGCATTATTA TAACGAGGAA CGTAGGCAGG GTCTCCACTC 1260 

AATACGTAAC CTACGATTTG GTTAATTGGG TTGTAaCCCT TATCGTTCAA CGAAGCATAA 1320 

ACATCTGTCA AAGTTTCGCT AATTTCTTTT TTATTGGAAT CGTCCAATTT AAAACGTACT 1380 

GTTTCTTCAG TAAATCCCAT TCTAACACCC TCTTTCCTTA GAATAGTACC ATTATAGCAT 1440 

AATTCCTTAC CTTCTACAAT TCAGGCAGTC TATTTATTTG GATTTTCTAT TGTTCTGTCG 1500 

CGCCATTTGC CAATCTATCT GAAATATATT TGCTTGGTTC ATTTTTCAAA AGATTTTCCA 1560 

AACCAATATT CTTCAGATGT TCCAACTGGG AAGCCTTCTT GACATCCAGA ACTTGAAAAT 1620 

CAAAACTAGT CGTTGTTTGA AGTTCCGTTG CGCTCAATAG TTTTGTTTCA AGTTTGAAAC 1680 

CTGCCAATTT ACGAGCTTCA ATGATAGACT TATCCTTCTC CTCCGCTTCA AGAAGAGCTT 1740 

TTTGAGTTTC CTCCACTCCA TGTTG 1765 
(2) INFORMAT I ON POR SEQ ID NO: 283: 

li) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 
CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT ATACCACATT 
TGCCCCTAGA TGTGAACGAG AGAAACGCTC TAGACATTGC CAAGAAGGAA AAAAAAGGGT 
ACAATGTAAC AAAATCAAGG GAGCTCTGGA ATGAAGAAAC AAAGCAAGTA CAAAGAGGTC 
GTTTCCTATC TGAAAAATGG TATCGAGTCT GGACGATTTC CGACGGGTAG TCGCCTGCCT 
TCTATCCGTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACGAGCCCTG 
CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATCTATTA 
GAACAAGGGC AACATCAAGA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 
GACGATTTCC GACTCTGTGT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 
TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTCTTT 
GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGGAAC CCAACAAGCC 
TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT GGTGGAACAG 
CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACCATT 
GAACGAGGCA TTGATGGGAT TGACTTGGAG GAGCTCGAAG GCCACTTCAA AACAGGAAAA 
ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 
CAAGACAAAC GATCTATTCT TAACTTACCT GCCAAGTATG ATGTCTATAT CGTAGAGGAC 
GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACACAG 
GACCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCCT TCGTATTACA 
GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTGG CCTACAAAAA TATCCTAGAC 
TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG TCAATTGTTT 
GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 
AGGATAACTA AAACACCTTG TCCCCTTCCT CATTATTCCC TACACGATCG yTTATTGCTA 
GACCTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 
TTTGAAGAGG CCTATTTAAG CACCTG 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
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420 
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600 
660 
720 
780 
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900 
960 
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1200 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTGCGGAGGG TTCATTTGAC 60 

ATATTTAGAA AACTCCCCCA AAGAATTAAT TTTAAGAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTGCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AGAATGGAAA TATTTTTTCC TCTCTACTAT TTTATTTAGA AGATTTAATC 240 

TATCCTTCGA TTGTTAAACC TTTCGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 300 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 360 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGACTCT 420 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTTATCAA ATTTAGAAAA TGATTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT . 540 

GATACACACA GCCAAGGAAA AACTGTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GTATATCCGA 660 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAACTA TTGCTTATTC GGATCACAGC 720 

TATGAAGAAT TTATTGATTA TCTACCTCTA GAGCAAAAGA AAAATTTACC TGAATATTAT 780 

TATAATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT CTAATAATAG ACTTTGAAAC AATGTTACGG 900 
(2) INFORKAT ION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 285: 

TTATTTAGCA GAGGCAGTTT TAAATGTGAA GGATTTGGTC AGTCAAACAG TTTTTTATCA 60 

GCAGATTATT GGTTTAGAAA TCCTATCTCA AACGGATACA GAGGTCGTTC TGGGACTTGG 120 

AGGAAAAGCC TTGGTACACT TGATTCAAGC ACAAGAGGGT CGAGAAGTAA GGGAACATTA 180 

TGGTCTTTAC CATCTGGCTA TTCTTTTGCC GACACGAAAG GCTTTGGCGG ATGTCTTGAA 240 

GCACCTGACG GATTTACAGA TTCCTCTTGT TGGCGGTGCA GATCACGGTT ACAGTGAGGC 300 

CCTTTACTTA GAGGACTTGG AGGGAAATGG CATTGAACTC TATCGAGATA AGCCAGTTTC 360 

CACATGGGAT ATTCGAGAAG ATGGACGTAT TATCGGGGTG ACTGAAGTCC TTGCGGCTCA 420 

GGATATCTAT GAGTTGGGGG AAAGAGTAGA GCCTTTTATC CTAGCAGAGG GTACGAGAAT 480 
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CGGGCATATT CATCTTTCTG TCAAGGATAC TCGAAAGTCC AGACAGTTTT ATCAAACGGT 540 

GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TACTGCTACT TCGATCGCAG CTGGGGACTA 600 

CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTCGATCCGC GTAAACAAGT 660 

CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACGAT 720 

TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 780 

AATCACAGAC TCAGATGGCA TCGTGACCCG TATTCGTTTA GCTAGATAGA TGCTATGTGA 840 
TGAAGGTAGA GCATCAATTG TA 
(2) INFORHATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 
GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 
AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCTCATTTTA 
TCATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 
TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 
ACAAATCCAA GCGACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 360 
ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 420 
TTTCCAAGTC CTCGATGTTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 
GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 
AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcTtG ACTGTAAGGA 600 
AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 65 o 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 
120 
160 
240 
300 



460 
540 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

GATAGCAATC CGCTTCAGAA ACTTCTCCCT TACCTCTAAC TCCGATCGCT ACTTTGGGAG 60 

AAGATACTTC CATTCTCATA CTATCTGTTG GCTTTGCAGG CTCTAAAAAC AACTTTTCTC 120 

TTGCTACTTC CTGAAAATCT GAATCTTGCA GTTCTTTGCT TTCAAAATAG TCCTGTACTC 180 

GCTCCACATC AAAATTCCCA GCTAAAGACA GAGACATGTT TACAGGTTTG TAAAACTTTG 240 

TAAAATTTTC TTGCAAATTA GTTAGATTCA TTTGGGAAAT GGACTCCTCA CTTCCAACTA 300 

TATCAGTTGC TAAAGGTGTA CCAGGATACA AATTOGCTAA ACTTGAAAAG AATAAACACG 360 

AATCTGGATC ATCTTGGTAC ATTTCTCGTT CTTCCTGAAT AATATCCTGC TCTGTCAGAA 420 

TGGAAGCrTC AGTAAAGTCT GCTGATGTTA CCAATTCATC AAGTAAATCT AAATTTTCTA 480 

AAAAATAATC CGTTGCTGAA AAAAGATAGT TTGTTTTTGT AAAGCTTGTA AAGGCATTAC 540 

TATCTGCACC TAGACTCGTA AAAGCCGACA TCAAATCACT AGAATCTTCT CTCTCAAATA 600 

ATTTATGTTC AAGAAAATGA GCAATTCCTC CAGGATATTG TTTTACATCT CCGTCAACTT 660 

CTGTGACAAA CGTATCTACC GAACCAAACT GTACAGTGAC ACTCCCGTAA ACCTCTTTAA 720 

ATTCCTTTTT AGGCAAAAGA GCAACTGTCA ATCCGTTGGC CAAACGAGTT CGATAAACCA 780 

TTTCTTTTAC AGCTGGATAG TATTTTTCTT CAAAAACAAC CTTTGTCATT CTATTCCTTC 840 

CATAAAGTAA ATCGCTTGTA GTTTCACATT ATTAGCTACT CTACAAATAG CATCTTTGTC 900 

AATTTGTTCA AGCTTTGCAA TCCAACTTTT AAAGTCTGCT GAAGATTTTC CAAATAAGGC 960 

ATTTTGATAA GCACGTTCAA TCAATGAAGA ATGATTATCT TGAGAAAGTA ACAACGACCA 1020 

ACGAATCATT TCCTTGGTCT GATTTAACTC AAACTCTGTA AAAAAACCTT TTTTTAAATC 1080 

AAGCCGTTCA TTATTCATCA ATTTACGAGC CTGGTTACG 1119 
(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTCCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTGC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA TGTGTGTGGC atgaatgctg ttcccaatcc cactccagaa cattctcctc 

AAAAGTCCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 
GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CGCATCCTCC CGCTCAAATC GGGTGCATTT 
CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GCTGACTCAA GCGTCCTTTC 
AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1949 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEONESS : double 
<D> TOPOLOGY: linear 



60 
120 
180 
240 



fxi> SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
AAAGAATTCG ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGACTGTTC CCCCGTCAGT 
TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 
TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA 
CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 

GTCATTCGTT ACTAAATACT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTC 300 

CTGAAAAAAG ATAAGACCAC CATACTGGGT TAAATGACCT CCATCGAAAG ATAGTTGGTA 360 

AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CCTTTCTTTT 420 

TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTCTCTA GTAAAAAACA 480 

CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTGGATGCT 540 

TTTTAGATTT GGGGTGAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 600 

ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAC 660 

ACAAAAAAAG GGGTAGACAA TCTAGTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 720 

TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 780 

GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 840 

AGTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 900 

AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT CTTTAAGACC 960 

AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 1020 

CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATCAG CTGCAGCAAC 1080 
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AGCTACAGGA 


GCAGCTGCAG 


TTACACCAAA 


TTCTTCTTCG 


ATAGCTTTTA 


CAAGGTCGTT 


1140 


CAATTCAAGC 


ATTGAAGCTT 


CTTTAATTTC 


AGCAATAATG 


TTTTCAATGT 


TCAATGCCAT 


1200 


TGTTATTTCC 


TCCAAATAAG 


TTTTAAATTT 


TATAATACTT 


TTTTTCGTAG 


CTAGJcsTACG 


1260 


CTGTGTAGCT 


TAAGATTAAG 


CCGCGTCTTC 


TTTGCTTTCT 


GCAACCGCTT 


TGACTGCAAG 


1320 


AGCAACGTTG 


CGCACTGGCG 


CTTGAAGTAC 


AGAAAGGAGC 


ATAGAAAGAA 


GTCCTTCGCG 


1360 


GTTTCGAAGA 


GTTGCAAGTG 


CAAGAATCTC 


TTCTTTAGAT 


GCGACAGCGC 


CTTCGATTGC 


1440 


ACCACCTTTA 


ATTTCAAGTG 


CTTCAGCGTT 


TTTAGAAAAG 


TCGTTCAAGA 


TTTTCGCTGG 


1500 


TGCGATAACA 


TCTTCATTAG 


AAAATGCTAC 


TGCAGATGGT 


CCAACAAATA 


CAGATGCAAG 


1560 


ATCTTCAAGA 


CCAGCTTTTT 


CAGCTGCACG 


ACGCAAGATT 


CAGTTTTTAA 


TAACTTTATA 


1620 


CTCAACTTCG 


CTTCCACGAA 


GCTCACGACG 


AAGAACTGTA 


TCTTGCTCAA 


CTGTCAAACC 


1680 


ACGAGCGTCT 


ACAACGACGA 


TAGATGCAGC 


AGCTTTCATT 


TTTTCAGCTA 


tACGTCAACT 


1740 


AGTTCCGCTT 


TTTTAGCAAT 


AATTGCTTCA 


CTCATTAGTG 


TGTTCACCTC 


CGTAATTATT 


1800 


TTGCTTGGGG 


AATTTTTCAA 


AAAGAAAAAC 


GCGCCCAATC 


CTAGACACGA 


AAGTACAATA 


1860 


CGCTTCTTTT 


TACATGATAC 


GTTTTGTCCT 


CGGTAGGATA 


TTTATGAGTC 


GAGCTCCCCT 


1920 


ACTGTCTTAG 


GCAGTTTTTT 


TAGATACGG 








1949 



(2) INFORMATION FOR SEQ ID NO: 290: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTCGGTG.\A 120 

CACGGCACTC GTGTGAGCAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 460 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 
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TAGCPCCACG GAAATGTCCT AGGTCGTACT CATAATCGTT ACGTGTGTCA AGGACAACGG 
TATCTTTATC AAGAAGCGCT TCTTTGAACT CTTTTGGAGA CAAGTAAGCA CCTGTTGTTT 
CAAGTGGGTT GATGTCATTG TCAAAGTCGT TGTCTTCCAA ACCAAGGTGG ACAATTTCTT 
TCTTGTACCG AACAAACATC TTCTTGAACG CTTGTTCATT TTCTTCGTCA ATCTTGAACC 
AGAGTTCTTC CATTCCTGGA AGGCTGTGAA CGTAgTCCAT GTATTTTTGA GTTGTTTCAT 
AGTCACCTGA AACTGTTCCG TTAATTCCCT CGTCAGCGAC TAGGATACCG CCTTTAAGGn 
CGATTGATTT ACAGAAAGCC AAGTGGTCTG CAGCAAATTG CTCTGCATTT TCAATTGGAG 
TATAAAGGTA GTAAAGTAAG ACACGAATAT CTTTTGkCaw AAGATTTGTA TCTCTTTATC 
TAT 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
ACTATGAACA AGACCCAGAA AAAGTAGCCT TATTTCTTAA GAATTTTAAT AGTTTAAAGC 
ACCTAGCACC TGTTTAGATT GACGAAACAG GATTCGATAC TTATTTTTAT CGAGAATATG 
GTCGCTCATT AAAAGGTCAA TTAATAAGAG GCAAAGTATC TGGAAGAAGA TATCAGAGGA 
TTTCTTTGGT TGCAGGTCTA ACAAATGGTG AATTAATCGC TCCAATGACT TACGAAGAGA 
CCATGACGAG CGACTTTTTT GAAGCTTGGT TTCAGAATTT TCTCTTACCA ACATTAAACA 
CACCATCGGT TATTATTATG GATAATGTAA GATTCCATAG AATGGGGAAG CTAGAACTTT 
TATGCGAAGA GTTTGGGCAT AAACTTTTAC CTCTTCCTCC CTACTCGCCT GAGTACAATC 
CTATTGAGAA AACATGGGCT CATATCAAAA AGCACCTCAA AAAGGTATTA CCAAGTTGCA 
ATACCTTTTA CGAGGCTTTT TTATCCTGCT CTTGTTTCAA TTGACTATAT TAGAGGCGAG 
ACATTTTTCG GTTCTTTGTC AACTGTAGTG GGTTGAAGAA AGCGAAGATC TAGAAAGGAC 
AAATTTCGTC CH-lVmW TGAAGTTTTC AAAGTTCCTA AAACCAAAGG CATTGTGCTT 
GATAAGTTTG ATGAGATTAT TGGTGGCTTC CAGTTTGGCG TTGGAATAAG GTAATTGAAG 
GGCCTTGACG ATTTTCTCTT TATCTTTGAG GAAGGTTTTA AACAAAGTCT GAAACAGAGG 
TGGAAAAGCA AGAGCTGATA GAGATTATAG TGGTGTTTAA AGTCTTCGGA ATAGCTCAAA 



600 
660 
720 
780 
840 
900 
960 
1020 
1023 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



WO 98/18931 



PCT/US97/19588 



1320 

AGTTTATCTA GAATTTCTTT ATTAGTCAAG TGCATACGAA AAGTAGGGCG ATAAAATCGT 900 

TTATCACTCA GTTTCTGACT ATCTTGTTGA ATGAGCTTCC AGTAGCGCTT GATAGCCTTG 960 

TATTCATGGG ATTTCGGATG ATGGCTTGTG TTCTGCTCTC AAGAACAGTT ATGATATTGA 1020 

GTTTATCAAA GTCCTGAGCA ATAAAGCTCA TCTCCATCTC CCGATTGAAA CAGTCACTCC 1080 

CCGGACTGTT TCAACsTCCT AGGACATAAT CTCAGGAAGA 66CGAAAAAT CATGCTCAAA 1140 

GTGAAAATCA TTGTTCTTGC GAATGACAGT TGAAGTTGAA ATAGACAACT GATGATCAAT 1200 

GTCGGTCATA GAAGTCTTTT TAATTAGCTT CTGAGCAATC TTTTGGTTGA TGATACAAGG 1260 

AATTTGATGA TTCTTCTTGA CGATAGAAGT CTCAGCGAGC TCCATTTTTG AGCAATGATA 1320 

GCACTTAAAA CGGCCTTTTC TAAGAAGAAT TCTAGTTTGA ATTTTTTTAT ACTAGAAAAT 1380 

CAGAACCATA ATACCTATAT AAAAATATTA TAGTTCTAAT AGGATTTACC CAAAAGTTTT 1440 

AAGGCGGTCT TTTTAGAACT TTAATTGTTT GAAATTTAGG TAGCAAATTT GTTTCTATTT 1500 

TGTCAACTTT TCCTATTTTT ATCTTGTTGA GGCTGGTATT TTAACAATTC AGGAATTGAT 1560 

AGTGAATGTG TAAAATTTTT TGTTAGAATA AGTTTATAAA AAAGAAAAGG AGTATTTGAT 1620 

TATGTTACAA AAAATTTATG AGCAGATGGC TAATTTCTAT GATAGTATTG AAGAAGAGTA 1680 

TGGTCCTACA TTTGGTGATA ATTTTGACTG GGAACATGTT CATTTTAAAT TTTTAATTTA 1740 

TTATTTAGTG AGATATGGCA TTGGTTGTCG TAAGGATTTT ATTGTTTACC ATTATCGTGT 1800 

TGCTTATCGT TTGTATCTTG AAAAATTGGT AATGAATCGG GGTTTTATTT CTTGTTGAGG I860 

TAATTTTAGT AAATTTCCGA ACTAATTTAC TCTTTTATGG AAAGATGATA CTAAATAGCT 1920 

AGTAATTTTT CTAAATCATT TTTTAATAGT TCGAAATAGC AAATCTTTCT ATTGTTTCTT 1980 

CTTGATAAAA AGGCGATTTT TTATTATAAT AAATTGTAAG ATATAATTGC AGGTGAGAGT 2040 

CCTGCCATGT ATGTGAGAAA GGAAGAGCCT GATGGCTCAG ACAAGATTAT GACTTCAGTT 2100 

GTTGTTGTAG GTACCCAATG GGGTGATGAA GGTAAAGGGA AGATTACAGA CTTCCTTTCA 2160 

GCGAATGCAG AAGTGATTGC ACGTTACCAA GGTGGTGATA ATGCTGGTCA CACGATTGTG 2220 

ATTGACGGTA AGAAATTTAA GTTGCACTTG ATTCCATCTG GGATTTTCTT CCCTGAAAAA 22B0 

ATATCTGTCA TTGGGAATGG TATGGTTGTA AATCCTAAAT CTCTTGTAAA AGAGTTGAGC 2340 

TATCTTCATG AGGAAGGTGT AACAACTGAT AACTTGCGTA TTTCTGATCG TGCGCATGTT 2400 

ATTTTGCCTT ATCATATCGA GTTGGATCGC TTGCAAGAAG AAGCTAAGGG CGACAATAAG 2460 

ATTGGTACGA CAATTAAGGG AATTGGTCCA GCTTATATGG ACAAGGCTGC TCGTGTTGGA 2520 

ATTCGTATTG CAGATCTTTT AGATAAAGAT ATTTTCCGTG AGCGTTTAGA ACGTAACCTT 2580 

GCTGAAAAGA ATCGTCTTTT TGAAAAATTG TATGACAGTA AAGCGATTGT TTTCGATGAT 2640 
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ATTTTTGAAC AATATTACGA ATATGGTCAA CAAATCAAGA AATACGTGAT AGATACATCT 2700 

GTTATCTTGA ATGATGCGCT TGATAATGGC AAACGTGTGC TTTTTGAAGG TGCACAAGGT 2760 

GTTATGCTAG ATATCGACCA AGGTACTTAT CCATTTGTTA CGTCATCAAA CCCTGTAGCT 2820 

GGTGGTGTGA CAATTGGTTC TGGTGTCGGT CCAAGCAAGA TTGACAAGGT TGTAGGTGTA 2880 

TGTAAAGCTT ATACGAGTCG TGTAGGAGAT GGTCCTTTCC CAACTGAGTT GTTTGATGAA 2940 

GTGGGAGAAC GTATCCGTGA AGTGGGTCAT GAATATGGTA CAACAACTGG TCGTCCACGT 3000 

CGTGTAGGTT GGTTTGACTC AGTTGTGATG CGTCATAGCC GTCGTGTTTC TGGTATTACT 3060 

AACCTTTCTT TGAACTCTAT TGATGTTTTG AGCGGTTTGG ATACTGTGAA AATCTGTGTC 3120 

GCCTATGATC TTGACGGTCA ACGTATTGAC TACTATCCAG CTAGTCTTGA ACAATTGAAA 3180 

CGTTGCAAGC CTATCTATGA AGAGTTGCCA GCTTGGTCAG AAGATATTAC CGGAGTTCGC 3240 

AATTTGGAAG ATCTTCCTGA GAATGCGCGT AACTATGTTC GTCGTGTGAG TGAATTGGTT 3300 

GGCGTTCGTA TTTCTACTTT CTCAGTAGGT CCTGGTCGTG AACAAACAAA TATTTTAGAA 3360 

AGTGTTTGGT CCTAAGAGAT TTTTAAGATT TGTTTAAGAT AGGTCGGGTA TACTATAGAC 3420 

GGTTACAAGA AGACCTCCTA ACTTGTTGTA ACAAATATCC TAAACTTTTC TTTTTCATAA 3480 

TAATCTCCCT ATAGAGTCAC CGCATTCGGT GGCTTTTTTT GTGTTGGGAT TCATGATATA 3540 

ATAATAAAAT CGATAAGTAG GAAAAGAGAA AAGAGATGTA TTATACGCTT GAAGAAAAAC 3600 

AAGTCTTTAT GAGGGAGGCT TTGAGAGAGG CTGAGATTGC TCTTGAACAC GATGAAATTC 3660 

CAATTGGTTG TGTGATTGTC AAAGATGGGG AAATCATTGG TCGTGGGCAT AATGCGCGTG 3720 

AGGAATTACA GCGAGCGGTT ATGCATGCGG AAATTATGGC TATAGAGGAT GCGAACTTGA 3780 

GTGAGGAGAG TGCGCTTGCT GGATTGCACA CTTTTTGTGA CCATTGAACC G 3831 

(2) INFORMATION FOR SEQ ID NO: 292: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1441 base pairs * 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG AGTTGTTATT TTCAACTCTT 120 

ACTATTATAC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 
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CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 
AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 
ATTATAGTCC ATCATCAATT CTGCATCTTT ATTTTCAGCA TTAACCTCTA AAAGGAATTC 
TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA GGTGGTTTGT CTCGAAAGGC 
TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 
TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 
TCCATCTCCT GCTTCTACTC CAGCTACAAC ATGATTGGCA TAAACAAAGT CTGATACTTC 
ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 
ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCCAT ATGCTGAATT GATTTTTCAC 
GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCCATG CATTTCGCCT GAAACTGCGA 
TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 
TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 
TGAATGCTTC AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 
TCAATTCTGC GAAATCTGAG AAGAAAAGAT CTGTCAATGG GATAATCTCA TCTACTGATT 
TCATTTGTGG TTTATAGAGC TCAACTAATT TTTCAGCCTT GTCAGTCAAA CGGCCTGCTT 
CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 
CATTGCTCAT CCAGTCTAGT TTTTTCTGAT CAAAGGCTGC TGGTGACTTG CTGAGGCGGT 
TTTCATCAAA AAGTTTAATG AATTCTTCAC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 
TCCAACCAAG AAGAGCAATA AAGTTAAAGA CTGCTTCTGG AAGGTAACCT TTCTTTCGGT 
AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 
AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGAGCTT CCTCAACCTA AGAGCGGGTA 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1441 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACGA GGGGAGATTA GAGGCGCTAG 
AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 



60 
120 
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ATATGCACTT GATTGGACAA ACTCAAGTGG 



ATTTATCTCT CAACTCTTTT GCCGATATTG 
ACCATTCATT TCGTTTTCTC AAGGCCAGTC 
CTTTGATTGA AGAAGATATT GCGGCAATTC 
AATCTAAAAA TAGTGGTCGT GTTCTTCATG 
GAGTTGCTGA AAATTCAGAA CAGTATGGTC 
AAAATATCCA ATCTGAATTT TCACAATTTG 
AAGCCGCAGT GATTTTGGAT AATACAGAAA 
ATCGTGTTCC AGCCTTGGTT ACGACGCTTT 
TGGAAGCCGG TTATCGTAAA CTAATTGATG 
AAGCGCGTTT CCACTTGCTT TATGAAGCAT 
TGGAATTGGA TAATGCCGAA TATGAGAATG 
ATGATATTTT TACTCGAGAA ATTGCTGCTC 
TTCCAACTTA TCTTCAACAT ATGAAAGAGA 
GTTTGAACAA GACCTATTTA CTTCCTGAGA 



CAGAATTAGA GACTTTTGAG 
CCCAAGCTTA TTCAGTTCTT 
TTGAAGATGA GCAAATTTCA 
ATCCACCTCA AAAGGCCAAT 
AAAAACGCAA TCTGCCAGGT 
ATAATACCGA GGATTTAATG 
CCCGAGTTCT TGAAATTGCA 
TTGTACAATA TGCAACTTTG 



GCACCTATTG 

GAAGAAAATC 

GTTAGTGAGC 

GTTTATGTCA 

ATTCCACAAA 

GTTGAGTTAG 

ACGAATGATA 

ACAGAGCAAC 



CTTTCCGTGA 
AAAATAATCT 
ATCAAATTGA 
GCAATGCTTT 
CTTTGGATTT 
AAGCCTTGGA 
TAACCTTGAA 
ATCACATTTT 
CTACAGAATT 
CTAATTATCA 
TCAAGAAAAA 
GACAGGCACA 
AGAAAGTAGT 
ATAATACTTT 
CAGCTGCAAG 
TTGAGGTAAC 
TTGAGGATTT 
GCCTGACACA 
ATCGTCTCCA 



ATGGAATCAA 
CTTTGAAGCA 
CCAAATTGAG 
GGCAGACTTA 
ATTTGAGGAA 
TGAAATTGAA 
TTCATCGGGT 
GGCCTTAAGT 
GCCAGATCAA 
TTTTGTTGAA 
CCAAGAGAAT 
AGAGGAAATC 
GGAAAATCTA 
ATTGGGAGAA 
CCATGTTCGT 
TTCAAATCAA 
ACAAACTCAA 
AATTGAGAAA 
TACTATCAAG 



AAATGGGTCG 
GAAGGCTATA 
AGTCAAATTA 
GAGAAGCAAG 
CTTCAGCATA 
AAACAATTAG 
GACCCTGTGG 
CATATTGTGG 
TTACAGGATT 
ACGGATATTG 
ATTCGTCAGT 
AATGCCTTGT 
CTTGCAACTC 
GATATTGCAC 
CGTATTCAGA 
GAAGAACCAA 
CTAAAAGATA 
GATGATATTA 
CGATACATGG 



TTGATGAACG CATTCAAGAA GCATTTAACG 
ATTATCACGC TTCATTTGAC AAGATTTCTC 
CCAATCGCTT TGTTACCTCA TATGAGAAAA 
AAAAGATTTT ATTGTGTGAG GAGCAGAATC 
ACTTCATTTT CTCCTGAGAT TGAGTTTTTG 
GTGTTTTATA CTCTTCGAAA ATCTTTTCAA 



CTTTCTTGAA 
AACAAAAAAT 
TGGAAGCTTT 
TCTTGCAATA 
AttGCTTTAGA 
AAGCATTGGA 
CACGTGAAAC 



GTTATTCTTT 
GATTAACATT 
AGAAACGGAA 
TTCTAACCGC 
TATTTTTGAA 
AGTGGCAGAG 
GATTCGTTTT 



ACGGCAAGCA 
GAATCTGTTA 
ACTTATAATA 
TATCGCTCAT 
AAAGAATTTG 
CCTGGTGTAA 
TAATAAAAGA 



AAATCTTTTT CTATAGTTGT GGGGAGATTT 
CCCAGCCGAT TTATCCACTA CCTCAAAACA 
ATCACGTCAG CGTCGCCTTA CCGTACTCAA 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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GTACAGCCTG AGGCTAGCTT CTTAGTTTGC 
TTCGCCAGTC TTATCTGCAG CTTCAAATCT 
AACGAACTCT ATTGAAAAAT CTCCAGACTA 
ATTTCTTATT TGCACTTTTC TTGTACAACT 
TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA 
TCGCTATTTA TACTAGACTA AAATCAAAAA 
AAAGAAGAAA TCCAAACCAT CAAAACACTT 
AAACGCCTTC AAATCGTTCT ATAGTAAAAT 



1324 

TTTTTGATTT TCATTTAGTA TTAAAGTGAT 



GTACTTTGAG 
GAGAACTCAC 
TTAGTCCACG 
AGACATTCTA 
GCATTATATA 
TTAAAAGACT 
GAAATAAGAA 



TAACTTGGTA ACCGTCCAAT 
GGATAGTTCC TAATCTGGAG 
GTAAATAGAC CTCTAAAACC 
GAAGATAGGA TAGATATTTC 
ATAGTGATAT GAAATCAACT 
CTCGTACAGC TAAATATCAT 
CAGTACAAAT CGATCAGGAC 



AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG TACTATTCTA GTTTCAATCT 

AAAGAGATTA TAGAACTTTT ATAGTAGATT 
AGTCAAATTA ATTTATAGAA ATATTTTAGC 
ACTATAGACT GTAATCAAAC AACGATTTGG 
CTCGACTCTC TCCTTCAAGA AACACGTGGT 
GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGCCATTTG 
AAGGCTGCAG AGGCAGGAGA ATTTGTTACA 



ATTATATTTC GTCTGATGGG CAAATCTTAT 
GAAATAAGAT GTGAACAACT CTATCAGGAA 
AGCCAAGGTG TACTGTTATA GATTCAATAC 
CGAAATGTAA AAAAATATGA GGAGTTCGGA 



GAGTTAGGTC GTTCCTACAC ACGTGATGCC 
CGAAATATTA TGCCACGTCC AGAACATCCT 
TCTAAAAATA AAATCTCAAT TCAAGAAGAA 
TCGTAAGCTT CGCTTGATGT ACCAAGATGA 
ATCTTGTTGG GCTCCAATAG GAGTAGGTCC 
TCGCTATTGT TATGGAGCTG TTGATGCCCA 
TAGATGTAAT ACTGAGTGGA TGAACGCCTT 



ATTGATGCCT TATTTCAGGC TTATAAAAAG 
TTCTATCAAC TGTTGAAGTG CCATGGTTGG 
AAGAAAGCAG ACGCTCAAAC CATTGTCCCG 
AAGAAAGCGC TTTAAAACCA GTAGACGTTT 
GGCTGGTTTC GGTAGAATCA GTAAACTGGG 
ACATATCCAT AGTCACTATA TACGAGAATT 
TACAGGCGAA TCATTTTTCT TAATAGCTGG 
TTTAGAAGAG CTTTCACAAG CTTATCCAGA 



TGATTATCTT TTACTCCTTA TGGACAATGC TATATGGCAT AAATCAAGTA CCTTAAAGAT 
TCCGACTAAT ATTGGTTTTA CCTTTATTCC TCCATACACA CCAGAGATGA ACCCCATTGA 
ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 
AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 
CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGAGTAT AAAAAGAAAG 
TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 
AGGACAAATC GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 
ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 
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TCTACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 

CCGTTCTCAA AGCGTTTATA TACTAGCCAA AATCCTTGAC CATCCCACTA AAGGGCTTTA 3780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CCGAGAAAGA ATCCAATTCA 3840 

AAGTGGGTTT TAACTACATA GCCTAATGAC TCTATTCCCT GCCTCATATC TGTCTTGCCA 3900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTCCT TCTTCATTGA 4080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 4260 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATCAT TAAATTGATT GATCGTTACG 4320 

GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 4380 

TGATTAATAA AGTCTTAC 4398 
(2) INFORMATION FOR SEQ 10 NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 120 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 300 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 360 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGGG ACTGTGCTAC 480 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 540 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA TGAAGCAGTT TTGGCTACAT ACGCTTCTAA GAACCTATAG TTCACTGATG 660 

ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 71 B 

(21 INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 718 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 295: 

TCGGTACCAA AATTCTGGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGACA GTGAAGATAG TTTTGCAAAG 120 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATCCTCTTT TAGGTAAAAT AAATGCAATT 240 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGCACTCCA 300 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 360 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 420 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 480 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 540 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGCCA TAAGCGGTCT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATCCA GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCtTTTTAG GGAATTCATC AAGCATAA 718 
(2) INFORMATION FOR SEQ ID NO: 296: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTCCTCGTT TCCACTTAAT TTTGTGTCTA CTTTATTATA 120 

CCAAGTCCAC sCTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTT G C T T T TT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT 
TGAAAAGGGT GAGATATTAG TAGATAGAGC GCCAATAACG 
AATATCAATC TGAGGATTTT GAGCACTGAT GATAGCAATG 
TACAACGGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG 
AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC 
TGACTTGACC GCTACAGACA TTAGTAAGCT CATGCCGCAG 
GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG 
CATGAGTATT GAGACAAAGC CAATATCAAA TGTTTTTTGA 
GTTTGGGAAA ATGAGATGCA ACAAGGGAAA AAGCCAAACC 
GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT 
TGAGTTTGTT CTTCCCTTAC TATAGTGACT GTAACAGGAT 
TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT 
TCCCATTTGC TTAAACAGGC CTTGAAAGAC AATGCCTGAG 
CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC 
TTGACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA 
TAAAGCAGAC AAAAGGGTTG CCATCAGGTA TAAAATCATG 
GCCTGTGCGG TAGAGAATCT GTTGAGCCAA AACATCAAGA 
GTTATAAAAG AGAGAGACGC TAAAAATGGT AAAAAAGAGT 
AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG 
CAGCAGCCCA ATATTGATTT TGGTGCGGTA ACCAATTCCA 
(2) INFORMATION FOR SEQ ID NO: 297: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TGAATGAGAG 
GTGGCTGTTG 
ATGGGAAAGA 
GCTATCACAC 
CCTATCAGTG 
AGCATGATAA 
AGCCTTAAGG 
TAAGTAGCTA 
AAAACCATGC 
GGTTGGTCAA 
AATAAAAGCA 
GAAAAAGCTT 
CTACTGGTTA 
AAAGGGTGTT 
GTAGTGAAAA 
TAGAGGGCGT 
GTACCGTTAG 
GAGGTTGGCC 
AGGTAAGAAA 
ATGGCTAGAG 



AACCAGCTGA 
TGAGTAAGTG 
CGGCTGGAGC 
AAAAGAACAG 
TGACTAAACC 
TTGTAGCCCA 
CGAGGCAGAC 
TCCAGGCGAT 
TGCTGATCAT 
TAGTCAAGGA 
AGACGATGAG 
GCTCTTCCCA 
TCAAATTAGC 
TGTCCGCTTT 
ATCCAGCACC 
TAGGGTGGGT 
TTGTTGCAAC 
AAAAATGAAG 
AAGCAATAGC 
CAATGG 



300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 

1080 

1140 

1200 

1260 

1320 

1380 

1436 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 60 

GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 120 

CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 180 
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AAGGCGGCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT 
TTGGTGGACG AGATCTGGAA GCTTGCCCAA CGTGAACTGG 
ACACCATCTG AAAATGAGCA C TTCTTTCGT GCTCGCCGTA 
GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG 
GTCATTAACG AAGCGGACCA AGTTGAGTTT TATGGTCCAG 
TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG 
GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG 
AAAGAAGGAC TCATCAATCT TTATAAGGAA GATGGAACCA 
GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT 
TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG 
ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC 
CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT 
AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG 
ATCATAAGGA ACGCTATTTT AAATAGCAAA TCATCTAGTT 
ATTTCAAGAG AAAATGAAGT AAATCTTCCC ACAATAAAAC 
AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT 
GACTACTTGT TAAAACTGGG TTAATTTTCG ACTGTTTAAT 
AAGGTTAGAA TTGTCAAAAC AATCCGTCTA CAGTATCCGT 
TTCTCAGTCA TGCCGTTGGA AGTACGACCT TTACGATATG 
GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 
TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT 
GCAGTCTATT CACTATGTAT CAACAGTAAC CAACTGCTAC 
TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 
GCTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT 
CTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT 
TGATGATGCG GCGGTA 

(2) INFORMATION FOR SEQ ID NO: 298: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1022 base pairs 
IB) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY : linear 



TTGAAGCTAT 
CTACAGGATT 
AAATTCCTGA 
CAACAATTCG 
GTTTCCGTCA 
ACCGCGCTCC 
GAGATATGGT 
GCGTCACAGT 
GGATAAGCAA 
CTATGATTGT 
TTTTGAGCCG 
AAAACTACTA 
CTTTAATCCA 
TTGAAGTAGG 
GCATAATATC 
TTTCCTTTAT 
AGTTATTATG 
GATGCCAACC 
CCATTTGGGA 
GCCGTTGATA 
CTAAAAATCG 
AAGGCGGCTG 
TTGGTGGACG 
ACACCATCTG 
GTCGCTGAAG 



CAAACAAGAC 
TTACTATGGT 
GTACAAGTTT 
TCAACGAAAT 
TTTTGAAACC 
AAATCCAATG 
TCGTGCATTA 
TCGAGCTTAA 
ACAAGATGCA 
AAAAAGTATT 
TATCCCTTAT 
ACATATAAGC 
GGTTGCCAAA 
AGAAAACTCA 
AAGATTGTTC 
CTGGTATTTT 
CAAAGTCTAA 
GTGGTGGATG 
AAGAACGTAA 
TGTCTATGAT 
AAGGACGTAT 
TGGATGCCTA 
AGATGTGGAA 
AAAATGAGCA 
TGGTTTCTTA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1696 
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txi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



CCGAGTTTAT 


TATGGTTTCT 


TCGGAATTTA 


TCTCAAAGAT 


TGAATTTGCT 


TGCAATAAGA 


60 


AAGAAAGTCT 


TTATAGTCAA 


AGCAAATTTA 


AGTATGCGAT 


TCGTTCGATG 


TTCGCAGGTG 


120 


CATTTTTAAC 


CTTCAGTACT 


GCTGCAGGTG 


CAGTTGGGGC 


TGACTTGATT 


AATAAAATTG 


180 


CACCAGGTAG 


TGGACGCTTC 


CTCTTTCCAT 


TCGTTTTTGC 


TTGGGGCTTG 


GCCTACATTG 


240 


TITTTTTGAA 


TGCCGAGTTG 


GTCACTTCAA 


ACATGATGTT 


CTTGACTGCT 


GGTAGTTTCT 


300 


TAAAAAAAAT 


CTCTTGGAGA 


AAAACAGCTG 


AGATTTTACT 


ATACTGTACC 


TTGTTCAACC 


360 


TTATCGGAGC 


CTTGATAGCA 


GGGTGGGGCT 


TTGCTCATTC 


GGCAGCCTAT 


GCGAATCTGA 


420 


CACACGATAG 


TTTCATCTCA 


GGTGTTGTTG 


AGATGAAGTT 


AGGCCGCTCA 


AATGAATTGG 


480 


TCTTGCTTGA 


GGCGATTTTG 


GCAAATATTT 


TTGTAAATAT 


TGCGATTCTG 


TCATTTATTT 


540 


TGGTCAAAGA 


TGGTGGTGCC 


AAACTTTGGC 


TTGTGTTGTC 


AGCTATTTAC 


ATGTTTGTAT 


600 


TCTTAACAAA 


CGAGCACATT 


GCGGCGAACT 


TTGCTTCTTT 


CGCGATTGTG 


AAATTCAGTG 


660 


TTGCTGCGGA 


TTCAATTGCC 


AACTTCGGTG 


TTGGAAATAT 


GCTTCGCCAC 


TGGGGTGTGA 


720 


CTTTCATCGG 


AAACTTTATC 


GGAGGAGGCC 


TCTTGATGGG 


TCTTCCATAT 


GCCTTCCTCA 


780 


ATAAAAACGA 


AGATACTTAT 


GTAGATTAAG 


AAAATGAGCA 


CGATTGAGTC 


GTGCTTTTTT 


840 


CATTTTCAAA 


ATAAGGTAAT 


AGCTATTTCT 


TATATCAAAA 


TATAGAAAAC 


TGATATTTGT 


900 


ArACTATAAC 


TCAAGGTGCT 


ACAATATCCT 


TAATAAAATA 


ATATGGAGGT 


CACCTTATGA 


960 


CTTGTGATTT 


TAAATnTGAA 


ACTCTACAAC 


TACATGCTGG 


TCAAGTTGTG 


GCTCCAGCTA 


1020 


CT 












1022 



{2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 120 

CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 
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AATAGATTCT CTCGCATTAC TTACATAACC AGATATAGGC ATATCTGATA TAGATACCCA 240 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 360 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 420 

AATGTCCAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 480 

TATTTTTTCA AGTAAAACTT CGACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGCTG 600 

TTCTAGTCTA TTATAACTTT CAGCATATTC ATCTACTTTT. TCTAAAGCTG ATTCGATTGC 660 

TTC 663 



<2) INFORMATION FOR SEQ ID NO: 300: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



CGTCGCTGAA 


CATGTCAACA 


GCAAATTAAA 


CTAAACAAAC 


TAAAATTATG 


TGATACTTCA 


60 


CATAATTTTC 


TTTAGAAAAT 


ATTATCAGAA 


GAAAGTTGAG 


AAAAATGGCA 


GAAAAAACAT 


120 


ATCCTATGAC 


CCTTGAGGAA 


AAGGAAAAAC 


TTGAAAAAGA 


ATTAGAAGAA 


TTCAAATTGG 


180 


TTCGTCGACC 


AGAAGTGGTA 


GAACGCATTA 


AGATTGCCCG 


TTCATACGGT 


GACCTTTCAG 


240 


AAAACAGTGA 


GTACGAAGCA 


GCTAAGGATG 


AACAAGCCTT 


TGTCGAAGGA 


CAAATCTCTA 


300 


GCTTAGAAAC 


AAAAATCCGC 


TATGCTGAAA 


TCGTCAATAG 


CGACGCAGTT 


GCCCAGGACG 


360 


AAGTAGCGAT 


TGGTAAAACA 


GTCACCATCC 


AAGAAATTGG 


TGAGGACGAA 


GAAGAAGTTT 


420 


ATATTATCGT AGGTTCAGCT 


GGTGCAGATG 


CCTTPGTAGG 


TAAGGTTTCA 


AATGAAAGCC 


480 


CAATTGGGCA 


GGCCTTGATT 


GGCAAGAAAA 


CAGGTGATAC 


AGCAACCATT 


GAAACGCCTG 


540 


TTGGTAGCTA 


TGATGTAAAA 


ATCTTGAAGG 


TTGAAAAAAC 


AGCCTAAAAA 


CAGAAAAAGG 


600 


AGTGGGGAGG 


CGATGTGCTT 


CACTCACTCC 


TTTTTCCATT 


TTGCTACTCT 


TCGAAAATCT 


660 


CTTCAAACCA 


CGTCAGCGTC 


GCCTTGCCGT 


ATGTATGGTT 


ACTGACTTTG 


TCAGTTTCAT 


720 


CTACAACCTC 


AAAACAGTGT 


TTTGAGCTAA 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


780 


TATGTTTTGA 


GCTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAACCATGT 


TTTGAGCCGA 


840 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


TATGTTTTGA 


G 




881 
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(2> INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



CCTTTTTTAA 


TACAAGTTAT 


TTTGATTTAA 


CCGGCTTGTC 


TTCAGCTGTC 


TGCAAAGCTG 


60 


TGGCAATCGT 


ATCTGCATAC 


AATTTTGCTC 


CTGCTTCGAT 


AGTGCTACTC 


TCACTCCCGA 


120 


AATGAACCTG 


GTCTGTTCCA 


GCCCAAATTT 


CTGGATGCTC 


TTTCGCAACT 


TGATTCCAAT 


180 


CTGCTATCGT 


AATGTAAGGT 


GTCTTCTCTG 


CCAATTCTCT 


CATATAGGCA 


GCAGCCTTCT 


240 


CAACGATGGC 


ATAGGTCTCT 


TTTGTCTTAT 


CTCCCTCATA 


AGGAGTCACC 


AAAATCATAT 


300 


GGTGTCCCTT 


AGGAAGATTT 


TTCACGATAC 


TGTCCCAGTC 


ATCCTTGTAA 


TTCTCAGGAT 


360 


TATTTACCCC 


AGTCGCAATG 


ACCACCGTCT 


TAGGTAAAAA 


TTTATTCTGG 


CTATTATTTA 


420 


GCATGATTTC 


ATTTGCGGTC 


TTGGTTGTTA 


CGCTGACCTG 


CGCGTTAATC 


TGTGCTCCAG 


480 


GAAGAGCTGT 


CTGTAGTGCT 


GTATTTGCCC 


TTAAAGCCAC 


TGAGTCACCA 


ATTAACATAG 


540 


TGCCATCAGC 


AATTCCCAAA 


CTGTTTGCAT 


CTGCCCGTTC 


TGCCATCACC 


TTGGTCTGGC 


600 


CAATATTTGT 


TGCAGCTTGC 


TTCAAGCCAT 


TGACAGTCAA 


GTCTGTCTCA 


AACGCTCCCA 


660 


CTTGTGGTGC 


CAACAAGGTC 


ACCGTGCAGA 


CAATGATGGT 


CAAGATTCCT 


GTACCTGCTG 


720 


CAAGAATTGC 


GTGAATATAA 


GGCAGGGGAC 


GAAsGGTTTG 


GACAATAGGT 


GTGTTCTTGC 


780 


CTGCAATCCA 


AGGTTCCAAT 


ACATAAAATG 


ACAGACTGGC 


AAAGCCATAA 


GAACAAATCA 


840 


GAGTCAGTAA 


TACAGCAAGA 


AGATTTGATG 


TCAACTGTGA 


GAAAATGATA 


TAGAAACGCC 


900 


AATGGAAAAG 


ATAAACCGCA 


TAGCTAGTAT 


CCGCTAAAAA 


GCTGATAAT 




949 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO; 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACCTAC ACGGCTAAGG AAAAATAACA 
AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT TTAGCAAGGG GACAAACCCC 
TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGGA AAATGACAAC TTAATTTATA 
AAAAGGAAAA ATGGAGGATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 
TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AOAAAAACAA CGATCTAGTA 
ACTTAGATTT AATTTCTCGT AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 
TTGATAACTG GGAGGATCTT GGTCTTAAAC GATATCAGAC TCCGATGGAT GGAGCTAACA 
AAGTATTCTA TCGTCCGTCA GATGTGTATT TATTTTTAGC AATAAAATAG GAGTTATGAA 
ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATCGCTTTTA ACTGCCAGGT 
ATTGACATTC TATTTGTCTC AG 
(21 INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 
CGCTAACTTG CAAACAAAAG AAGAACGCAA ACTCCACAAA TCCTTTACGC AGAAACTCAA 
TCTCATCTAC TTACCTTGCT GACTTGGTAG AGTATGTTGC AGACAAAGAC TTCTCAGTAA 
ACGTAATTTC TAAATCAGGT ACAACAACTG AACCAGCGAT TGCTTTCCGT GTCTTTAAAG 
AACTCTTGGT TAAGAAATAC GGTCAAGAAG AAGCTAACAA ACGTATCTAT GCAACAACTG 
ACCGCCAAAA GGGTGCTGTT AAGGTTGAAG CAGACGCTAA CGGTTGGGGA ACATTTGTTG 
TTCCAGATGA TATCGGTGGA CGCTTCTCAG TATTGACAGC CGTTGGTTTG CTTTCAATCG 
CAGCATCAGG AGCTGACATA AAAGCTCTTA TGGAAGGTCC GAATGCAGCT CGCAAAGACT 
ACACTTCAGA CAAAATCTCT GAAAACGAAG CTTACCAATA CGCAGCTGTT CGTAACATCC 
TTTATCGTAA AGGCTATGCA ACTGAGATCT TGGTAAACTA TGAGCCATCA CTTCAATACT 
TCTCAGAATG GTCGAAACAA TTGGCTGGTG AATCAGAAGG AAAAGACCAA AAAGGTATCT 
ACCCAACTTC AGCCAACTTC TCAACTGACT TGCACTCACT TGGTCAATTT ATCCAAGAAG 
GAACTCGTAT CATGTTTGAA ACAGTTGTCC GTGTTGACAA ACCTCGTAAA AACGTGCTTA 
TTCCTACTTT GGAACAAGAC CTTGACGGAC TTGGTTACCT TCAAGGAAAA GACGTTGACT 
TTGTAAACAA AAAAGCAACT GACGGTGTTC TTCTTGCCCA CACAGATGGT GATGTACCAA 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 960 

GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTPGG AAAACCAGGA TTTGAAGAAT 1020 

TGAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACT C TTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 1140 

AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 1260 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GACCGTAAAC 1320 

GCCATGTCGA GGATGGTGAA CCTTCACAAC TTGAAAACCT TCGCTGGTTA GGCATGGATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACACCA CGCTACATCA 1560 

ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATC 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 1740 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGC 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 

CTGAAACTG 1929 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAACCT GAGAAAGCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCTACCTTG ATTATTCAGC CC T IVU TCTC TTCAGCCAAG ATTTGGAAGT 180 

TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA CTGAAAAACT TGCTTGACTG 300 

GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTGCCCTTC AAGACAAGTT 360 

TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACGATCAA CTTTTCGCTA TCTACAAAGA 420 

CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 480 

CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 540 

ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 600 

AAACTAJcGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 660 

CGAGTGTnnA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 708 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



CTTCTTTTCT 


TGGAAATAGG 


TGTATAATAC 


GTTTATTAAA 


TTTTTGAGGA 


GTTGTCTATG 


60 


AAGAAAAGTT 


TTATCCATCA 


acaagaagaA 


ATTTCCTTTG 


TCAAAAACAC 


TTTTACCCAG 


120 


TATTTGAAAG 


ATAAGCTAGA 


AGTTGTCGAA 


GTTCAAGGTC 


CTATCTTGAG 


TAAGGTCGGT 


180 


GACGGAATGC 


AGGACAACCT 


GTCTGGTGTG 


GAAAATCCAG 


TATCGGTCAA 


GGTTCTCCAA 


240 


ATCCCTGATG 


CTACTTATGA 


AGTGGTGCAC TCACTTGCTA 


AATGGAAACG 


CCACACCTTG 


300 


GCTCGTTTTG 


GCTTTGGTGA 


AGGAGAGGGT 


CTCTTTGTCC 


ACATGAAAGC 


CCTTCGTCCA 


360 


GATGAGGATT 


CCTTGGATGC 


AACCCACTCT 


GTTTATGTTG 


ACCAGTGGGA 


CTGGGAGAAG 


420 


GTTATCCCAA 


ATGGTAAGCG 


TAACATCGTT 


TATCTAAAAG 


AAACAGTTGA 


GAAGATTTAT 


480 


AAGGCTATTC 


GCCTGACTGA 


GCTAGCTGTT 


GAAGCCCGCT 


ATGACATCGA 


GTCTATCTTG 


540 


CCAAAACAAA 


TTACCTTTAT 


CCATACAGAA 


GAATTGGTAG 


AACGCTACCC 


AGACTTGACA 


600 


CCGAAAGAAC 


GTGAAAATGC 


GATTTGTAAA GAATTTGGAG 


COGTCTTTTT 


GATTGGTATC 


660 


GGTGGCGAGT 


TGCCAGATGG 


TAAACCGCAC 


GATGGACGTG 


CACCAGACTA 


TGATGACTGG 


720 


ACAAGCGAGT 


CTGAGAATGG 


CTACAAGGGT 


CTAAATGGTG 


ATATTCTTGT 


CTGGAATGAG 


780 



T 781 
(2) INFORMATION FOR SEQ ID NO: 306: 
(i) SEQUENCE CHARACTERISTICS: 
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(At LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 



420 
480 
540 
600 
660 
720 



fxi) SEQUENCE DESCRIPTION : SEQ ID NO: 306: 
CCCGCATCTT GTAGGGTTTT AACGGGCACG ATTTTCATAT CCCTCTTGAT TGTTTTAGCC 
GCTTCTAGGG CTGTTTGGTA GTTCTTTTTC GCGTCCCGAT GCGCCTTTTG TTCTTCTTCG 
CTAACAGGGT TATCAGGAGC AAAGAAAATA GCAGCACCTC CCCTAGCCGA AGCTACAACC 
TTCTTATCAA TACCTCCAAT GTCTCCCACA TTACCATCGC GGTCAATGGT ACCTGTACCG 
GCAACAATAC GACCATTACG AAGATCTGGG TGAGCTATTT GAGTATAGAT AGCTAGACTA 300 
AACATGAGAC CAGCACTTGG ACCGCCAATA CCACCTGTTG AAAAGCTAAT TGGGACATTG 360 
CTGATTACCT CTGTACGGTC AATCAAGCCG ATTCCAATTC CATTTTTGCC ATTTTCCAAG 
GTGATGATTT TTCCTTCTGC AGACTTGGTT TCCCCATCCT CTTCATAGGT GACCTTGACG 
GAATCCCCTA ATTTTTGAGA ACTGACGTAA TCAATCAAGT CTTTGGAACT ATCAAAGGTC 
TGATCATTGA CTCCTCTGAC TGTATCAGAG ATATTGAGAA TCCCTTTAAA GGTTGAATTA 
TCCGTCACAT TCAAAACATA AACTCCAAAG TACTTGAGTT CGATATCCTT ACCAGCTGTT 
TTTAGTCCTT GATACTTGGC CATATTTTGC GATGTTTGCA TGTAGAATTG ATTGATTCGC 
ATAAATTCAA CATCGGAAGA ACCACCTGTA GTCTCCTGAG CACTACGAAT ATCTGTAAAA 780 
GGTGTCAACC AAGCATAAAT CATATGAGCT AAAGTGGCAT GTTGAACACC AACCGTAACG 840 
AATTGT 

(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 
CB) TYPE: nucleic acid 
fC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 
CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 
TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 
GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GCTTGACACC 



846 



60 
120 
180 
240 
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AGTTACTTTA CAAATATCCC GATACATCAA GACGTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCCATGGTA GGTAGTTGCA AGACTACCTT 360 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 420 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT 720 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCAGCT 780 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG AGCTGGATG 829 
(2) INFORMATION FOR SEQ ID NO: 308: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

CGAACATCTT GCTGGCTGAT TOGTCTGCCG CCATCGCAGC CCCGAACACA TTGCGACCCA 60 

TGGCAAGCGG GCTCAATCOG CACATGGGAT CCGTGCCAAA GCCCCGCGTG TGCATCATTT 120 

GCTCATCTAG TAACGTATGA GGTTTGCCTT CGCTGTCGAT AAACCGATAT TCAATCGCAC 180 

CACTGCTCGT TCTCCGCGGA GGGGAAACCG ACTGCGGTAG GATGAACTCC AGAGAAGAGA 240 

GATCACGACC TACCAGGTGC GGCTCGTTGA AGCTGTTGCC GCTTAGCAGC AGGCTCGCCA 300 

CCACGCATTC CCAGAACTCA ACGGGGGTTT GATCGGCGTT CGGTTGCTGA CTAATAACTC 360 

GGTGCACGCG ATGCGAAGTG GCCACTTCTG GCACACCGTT CTTGTCTTCG TAGAGAGCAA 420 

TTGGGAGGGT GCCCAGCGTT TCGGCGATGA GGCGCACCCA GGCC 464 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA TGCTAATAGA TTTTATTTCG AGGTTTTTAT GTCATTTCTA TCAAAAAATG 60 

GAGCAGGTAT CTTCGCCTGC CTTCTCATTT CCATCCTATC TTGGTACTTA GGAGGATTCT 120 

TCCCTGTGGT TGGCGCGCCC GTTTTTGCCA TTTTCATAGG CATGCTCCTA CATCCCTTTC 180 

TCTCGTCCTA TAAACAACTG GATGCTGGTT TGACCTTTAG TTCCAAGAAG TTGCTCCAAT 240 

ATGCCGTTGT CTTGCTTGGT TTTGGTCTCA ATATCTCGCA GGTCTTCGCA GTTGGCCAAT 300 

CTTCACTCCC TGTCATCCTG TCCACTATCT CAATAGCTCT GATTATTGCC TACCTCTTCC 360 

AGCGTTTCTT TGCCCTGGAT ACAAAACTGG CTACCTTGGT TGGAGTAGGT TCTTCTATCT 420 

GTGGGGGTTC TGCCATTGCA GCGACAGgCC CGTTATTGAT GCTAAGGAAA AGGAAGTAGC 480 

CCAAGCCATT TCCGTTATCT TTTTCTTCAA TGTCTTGGCT GCGCTCATCT TTCCAACCCT 540 

CGGCACCTGG CTTCATCTAT CCAATGAAGG CTTCGCCCTC TTTGCAGGGA CTGCGGTCAA 600 

CGACACTTCC TCTGTAACGG CTGCCGCCAG CGCTTGGGAC AGTCTTTACC AAAGCAATAC 660 

CCTCGAGTCT GCAACCATTG TTAAACTCAC ACGTACTTTG GCCATTATCC CTATCACGCT 720 

CTTTCTATCC TACTGGCAAA GTCGCCAACA AGAAAACAAG CAAAGCCTCC AACTGAAAAA 780 

AGTCTTCCCA CTTTTTATCC TTTACTTTAT CCTTGCCTCT CTCCTCACTA CACTACTCAC 840 

CTCTCTAGGT GTGTCCAGTA GTTTCTTTAC TCCTCTCAAA GAACTCTCTA AATTCCTTAT 900 

TGTCATGGAC ATGAGTGCTA TCGGTCTCAA AACCAATCTG GTCGCTATGG TCAAATCCAG 960 

TGGAAAATCC ATTCATCATG GA 982 
(2» INFORHAT I ON FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CTAGCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 60 

CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 120 

GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 180 

TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGCTCAGATT 240 

TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 300 

GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 360 
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ACCTGACCAG CTGGGAATTT 
TATTTGGCTT TGGCAAGGGG 
CGGAAGCTGC CTTAAATCTT 
CTTCGATTGG AGGAATTCTT 



TTCAGGCTAA 

CTAAGAGTGA 

ACCGCTACTT 

CCCAATTGAC 

TCAAATGGTT 

CTGGAAATGC 

TTAACCGTGA 

CATCTTTTGA 

AATTTGGACT 

AAACCAAGGA 



AAATTATGCC 
CACCAGTAAG 
GGGTTCCCTA 
AATTGATGGG 
TAACAATCAA 
GGATTTGGTG 
TGTCAAACGT 
GGTGGACGAT 
TGCTGTTCCT 
ATACAGCTTA 



AGGAAACCAT TGAGCAAATC 
TGATTTCCAA GAAAAACGTG 
ATGACATCTA TCTCTACACA 
TCATCCTTGA AAATATGCGA 
AAGAATCAGC CCGTGAATCA 
TCCCAATCCT CATCAACCTC 
CTGGCTTGGT CAAAGAGTAC 
CTACTACAGT GGAAGAGATG 



CTCTTCTTCC 
ATAAACCTTG 
GAGGGTTTCA 
CTCTTGGCAG 
AATGTAGTCA 
GTTCCTATCC 
ACCGATAAGG 
AAACCTTATC 
GCCAAGGGAA 
GACTTGAAGA 
CACCTGCGCT 
GAGGGCAATC 
CGTCCTGCTT 
TCAGATGTTC 
AACTACAACG 
ACCCAGACTA 
GGTGTGACGT 
ACAGGAGAAA 
GCAGAAGGTG 
AATGACAAGC 
GCCCTGGTAG 
CTCAGCAAGT 



1338 
TCTGTGGGCA 



TCAAAACGGT 
AAATCAATCG 
CTTTGGTTTc 
CGGTTACGGA 
TAGATAGAAG 
TGTCGCAATA 
GGGTCACACC 
TCGGTGAGTA 
CACCAATCAA 
TGAAGTACCC 
CTTTCTATGT 
CACTCATTAT 
CAGAATGGGT 
GCAAGTACAA 
CCAATGGCTA 
CGGCTAATGC 
TCACTAAGTA 
CTGTTCAGGA 



TTTGTTAGTT 
TCATGTGCGC 
GTTAGGGAAA 
CTTGGTAACT 
AAAAGACTTT 
TACTGCTGAA 
CGTAGCGGCA 
ACTAGAATAT 
TATTAAGGTG 
GTATTCAGAC 
GACCAAAATC 
AGCAACGGTT 
CTTGGATGCT 
GGACAGCATC 
GGACGGTTTC 
TAATTACTTG 
GGATGAGAGT 
TAGCTTGGCT 
GAAATCCTAC 



GTGGCAATAT 
CACGGTAAGG 
ATTCTGTTAG 
TCCAGCATGT 
ACTGAATTTC 
AAAATTCGAG 
GATACCTATA 
CCAGACCCTA 
GACATGGTAA 
TCGGAGTATT 
TTTAAAACTC 
TACCAAAAGC 
ACAAATGGAG 
TATCCAGCAG 
TTGAATGCCA 
TCTATCGGTA 
AATCTTGGTT 
TCTGCGACAG 
AAAGCAACCT 



CTCTCTACAT CATGGGCTTG AAGGACAATG 
ACGCAGTCGA GTACCAAAAT GTTATCGTTG 
ATGCCAATAA AAACGACCTT GAAATTGACA 



ATGCAACGAC AGAAAGCATC AATGGAGTAG TAGCAGACCT CAAATCAGCT GTTATCAAtJG 
GAGACACTGT CTACTTCTTT AAAGTTGATG GCAACATCTA CAAGGTCAAG GCTTCACTAT 
CCGATGACCT TCCTTACCTT GAAAATGGTA AAACCTTCGA AGGTCAAGTA GGAAAAGACA 
ATTATCTCAA GACCTTTAAG CTAOGGTAAA AATAGGTTTT TTTCAGAAAG TATATGTTAT 
AATAAGGTAA ATTAAGCCG 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1939 
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(C) STRAND EDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CCTGCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 60 

CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTGCGGATA TAAATGTAAC 120 

ATTATTTTTC TAATCTGTCA ATAAAATTTC TGACAATTTA ATAAATACAA CAAGGAGAGA 180 

GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTTA ATCTGATAAA 240 

ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 300 

TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGGATTTTT 360 

TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCT TT GA TAGTATAGCC TTTTTTAGTT 420 

CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTGAAATA GCTAGTTCAT TCCTTTAACA 480 

TGAGCCTATG CAAGGTCCAA GACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 540 

TCGTACTCAT CTCCAAATAT TTTTAAGCTA TCATTTTGTC CCAATGCGGT CTTGTTGATA 600 

TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 660 

CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGTAGCGATT GGCCATCCAG 720 

TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCAGG 780 

GTATCTTCAG TCACCGCCTT GTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 840 

AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA GAACTTGGTT CATACCAGCA 900 

ACGTTGG 9Q7 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 
{S) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CCACATAAAG GTAAATATCT TTTGTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATGATG CGGTCTTTGA 120 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 240 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC TGCTTTCATT AGACAGAAAC 300 

AAAAACCCAA CCTCCTTTCG TGACTGGAAA TACTTTTCCA AGTCATTCTT CTTTTCGATC 360 

TTATTTTGTA CCGAACAAGC GGTCTCCAGC ATCTCCAAGA CCTGGAACGA TATAACCGTC 420 

TTCGTTCAAA CGTTCATCCA AGGCTGCTGT AAAGATTTCT ACATCTGGAT GAGCTTCTTG 480 

AAGGGCTTTT ACACCCTCTG GAGCAGATAC AAGGCAGACA AATTTGATAT TTGATGCGCC 540 

ACGTTTTTTA AGAGAATCAA CAGCCAAGAT TGCTGAGCCA CCTGTTGCCA ACATTGGGTC 600 

TACTACAAAA ATTTGACGTT GGTCAATGTC CTCAGGCAAT TTCACCAAGT ATTCAACTGG 660 

TTGAACTGTT TCTTCATCAC GGTACATACC GATGTGGCCA ACTTTAGCAG CTGGAACCAA 720 

GTTCAAGAGA CCATCAACCA TCCCGATACC TGCACGCAAG ATTGGGACGA TGGCCAATTT 780 

CTTACCTGCC AATTGTTTTT GAACTGTTTT TGTAATTGGT GTTTCGATTT CCACATCTTC 840 

TAGTGGAAGA TCACGAAGTA CTTCATACCC CATCAACATT GCAATCTCAT CTACTACCTC 900 

ACGAAAAGCT TTTCTAGAAG TATCTGTACG ACGCAAGATT GACAATTTGT GTTGAATCAG 960 

TGGGTGATTA ATAACTTCAA TTTTTCCCAT TTTTGGAATT CC TT CTTTCA ATTTATTCTT 1020 

CTTATTATAC CAAAAAACGG TTTAAAAATC TTTCTAAACC ATTTATTTTT GATAATTTTT 1080 

ACATTAGATC AGCCTCTTTA AGAGCTGTCT GTACTGTCTC AAGTGGTAAA TGGGTCAATT 1140 

CTGTCCCTTT TTCTTGATAA AGGTATTGGG CGTAGTCGTC CATTCGGTAC TGGTTGATAT 1200 

AAACCACGCG CTTGCAGCCG ACCTGAAGCA ATTGTTTTGT ACAGTTGAGA CAAGGAAAAT 1260 

GGGTTACATA GGCTGTAAAG CCTTTGGGAA CACCACGCTC AGCACCTTGA AGGATAGCAT 1320 

TGACCTCAGC GTGAAGGGTG CGAACGCAGT GGCCTTCAAT GACCAAACAT TCGTGATCAA 1380 

TACAATGCTC AGTCCCTGAC ACCGAACCAT TGTAACCAGT GGAAATAACC TTATTATCTT 1440 

TTACCAGAAT CGCGCCCACT TTAGCACGTT TACAAGTGGA ACGATTCGCA ATTAGTAGAG 1500 

CTTGGGCTGC AAAATACTCA TCCCAGGCCA GTCTTTTTTC AGTCATCTCT TTTCTCCTTT 1560 

TTCTCTATTT TTTAAAAAAT GGTAAACCTA AATCTGCAAT CTTTTCAGCT GGTACCTTCA 1620 

TGCCATCCTT GATCCATTTT AGAAGGACAG AGACGATGGC TGAGCTCCAG AAGGAATGAA 1680 

GATAAGAGCT GACACCTTTT GATTTCCCAT GGTATTTTTC TAGAAATTCC TGCATGGCTT 1740 

GGACAAAGAT TTTTTCCAGA TGGTAATCCA AGGCCAATTG AATTACTCTA GCTTCCTTTC 1800 

TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC TGTCTTTAAA TCGTAATGAT 1860 

GCAGCTGTTC CATAATATTG TGGACAGTTC GTTTAAAGAC GCTCTCTAAA ATTTCCTCTT 1920 

TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG ACCAATTCAG 1980 

AAATACTAAT CTTGGTCAGT TCCTTTTTTT CCAAGAGTTG CAAGAGGGCT GTTTCAATGG 2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 2100 
CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 2160 
CTTTTGTTTC 21?0 
(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGCGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 60 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 120 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 180 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 240 

TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 300 

AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGCTTCT TTTGGAACGT CGTTTGGATA 360 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 420 

ACGGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 480 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT CAAGCAGTA 539 
t2J INFORMATION FOR SEQ ID NO: 314: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 
IB) TYPE: nucleic acid 
(CI STRAND EDNESS : double 
(Di TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 60 

CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 120 

AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 180 

CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 240 

TTGGATGACT TGGAACACCA GGACTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 300 
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CACCTTGACT TGGGTGAGTA GGCACGGTAG GAGCTTTTCT CATAATCTCC TCTACCGTTG 360 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AAGGTTCATT TCCAGCACTG CGAACTACTG 420 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 480 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 660 

CTGGGAA 567 



(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315; 



GGGAAGCCAA 


GGTATTTTAT 


CGGATGAAGT 


TGTTACTAGT 


TCTTCACCGA 


TGGCTACAAA 


60 


AGAGTCTTCT 


AATGCAATTA 


CTAATGATTT 


AGATAATTCA 


CCAACTGTTA 


ATCAGAATCG 


120 


TTCTGCTGAA 


ATGATTGCCT 


CTAATTCAAC 


CACTAATGGT 


TTAGATAATT 


CGTTAACTGT 


180 


TAATAGTATC 


AGCTCTAATG 


GTACTATTCG 


TTCCAATTCA 


CAATTAGACA 


ACAGAACAGT 


240 


TGAATCTACA 


GTAACATCTA 


CTAATGAAAA 


TAAGAGTTAT 


AAGGAAGATG 


TTATAAGTGA 


300 


CAGAATTATC 


AAAAAAGAAT 


TTGAAGATAC 


TGCTTTAAGT 


GTAAAAGATT 


ATGGTGCGGT 


360 


AGGTGATGGG 


ATTCATGATG 


ATCGACAAGC 


AATTCAAGAT 


GCAATAGATG 


CTGCAGCTCA 


420 


AGGGCTAGGT 


GGAGGAAATG 


TATATTTTCC 


TGAAGGAACT 


TATTTAGTAA 


AAGAAATTGT 


480 


TTTTTTAAAA 


AGTCATACAC 


ACTTAGAATT 


GAATGAGAAA 


GCTACAATTC 


TAAATGGTAT 


540 


AAATATTAAG 


AATCACCCTT 


CCATTGTTTT 


TATGACAGGT 


TTATTTACGG 


ATGATGGTGC 


600 


GCAAGTAGAA 


TGGGGCCCAA 


CAGAAGATAT 


TAGTTATTCT 


GGTGGTACGA 


TTGATATCAA 


660 


CGGTGCTTTG 


AATGAAGAAG 


GAACTAAAGC 


AAAAAATCTA 


CCACTTATAA 


ATTCTTCACG 


720 


TGCATTTGCT 


ATTGGGAATT 


CAAATAACGT 


AACTATAAAA 


AATGTAACAT 


TCAAGGATAG 


780 


TTATCAAGGG 


CATGCTATTC 


AAATTGCAGG 


TTCGAAAAAT 


GTATTAGTTG 


ATAATTCTCG 


840 


TTTTCTTGGG 


CAAGCCTTAC 


CCAAAACGAT 


GAAGGATGGG 


CAAATCATAA 


GTAAGGAGAG 


900 


CATTCAGATT 


GAACCATTAA 


CTAGAAAAGG 


•rrrrccrrAT 


GCCTTGAATG 


ATGATGGGAA 


960 


AAAATCTGAA 


AATGTGACTA 


TTCAAAATTC 


CTATTTTGGC 


AAAAGTGATA 


AATCTGGGGA 


1020 
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ATTAGTAACA GCAATTGCCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT H40 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTGTACA 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 1260 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 1320 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 1483 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

CCTGAACGCT TTTTTATAAA TATCATAAAG CCAATCTGAT TTATCAAGTG TGTCTAAGCG 60 

ACGCGAATTA AAATTCATTG CATACTCCAT CGCTTCTAAA AAACTCATTT TTGAAAAGAC 120 

GTTAAAATCA TCTAAATTCT GACTCCAATA TAATAACAAA ACCAATCCCA TAATATCCTC 180 

TGGTTGATTA TTCAATAAAT TTAAGTTGGT TTCATAAAAC CCTGGAGTTC CAAATAGAGG 240 

CAACTTTTTT TCTTCAATTT GAGTTTCTTT CCTTAGGGCA TGCTCAAAGT CTATAATATA 300 

AATATTATTT CTATTATCAA TAAGTATATT ATTAAATGAT AAATCTCTAT AGGAAAGATT 360 

ATATTTGGAG TTTATTATCT CCATATAATC AATTAATGTT AAAAACCAAT CATACGAGCC 420 

ACTAACCATA TTATACTCGC TTAATTTATC TGCAATAATA AACTCAAATT CCACAAAATA 480 

CGAATTCTTT ATGTAAAAAT CGTTAAAAAC TTTTGGAGTA AATTCCTCCT TTTCCAATTC 540 

TACTAATATT TCTCTTTCAT TTATTAAACG ATTCACAGAA TCTCTATTTG TAAAATCAAC 600 

CAACGATAAA TCACTAGCTT CTTTTAATAA AGAATAAACT CGCTTTTGAG TATTAAATAC 660 

TTTATAAACT CCACCTTTGG CATTTTTAGA AATCACTTCC AAAATAATAT ATTGATCAGG 720 

AATAGTGTTA TATCTTGGAA TATAGTAATC CCTTATTGGA ACATTCACAT TTGAAGGGAT 780 

TTTCTTATCT CTTTTATCCT TGAAAGTGCT ATCTTTTACG AACTCCCCAT ATCTGTAATA 840 

TACAACCTCG CTAAGTTGAA ATCTGAAATC TGATGGTATG TTTACACCCT TTACACCTTT 900 



WO 98/18931 



PCIYUS97/19588 



1344 

ATACAATATT TCTAATTTGT GTAACAAACC TTGAAACTCT TTATTATCTT TTGGATAAAT 
TGTAATGAAT TTCCCGACTT GTGAATAACC ATTAAGCCCT GTATTTTGCA AAGAAAGTTC 
TTTAATGCTA ACCAAAATTT TGAAATTTAT CTTCTTCTCT CTAGAAAATA TAAAATCAAA 
GAATTTTTTA GCAACCAAAT TAGCATTTAA TATTGAAGCG CTCAGGTGTA TTTTAAATCC 
CTTAGATTGG GTGATATTAG ACGGCAAATT ATATAACCAA" TGTTCATCAC TAAAATTATC 
ACTAATTTTA TATTCTAATA ATAAATTATG GTATGCGTCT TCTATTTCAG TTTCATAGTC 
CAAATAGTTT AAATACTTTT CGTAATTCAT ATTAAGAAAT CTTCTCCATA AATTTTTAGA 
CCATCATTTA AAGCCAAACA ATTTAAAGCG TGATAATAAA TGTTGATAAT CAATGTAACT 
TTCAGTCCTC TATTTTGTAA TTCCTTCACC AATAATTTTA TGCTATATCT ATTTTCTCGA 
GGCAATTTAT AGGACTTCAA GATAAAACCA TAAAAGAGAT AAGTATTATA ATCTGACAAT 
CCAGTTTCAG AATAATTTTT TAGAAAAATA TCTAGTGATT CTGATAATTC ATCCGGAATA 
ATTCTTTTAA CATCGTATTT ATTTTTCATA TCGGCCACTC TTCCTTAAAA AGCTCACAAT 
AAAATTTTAA ATTTCTATAC AACAATCCGA GAGTAGTCTC ACAATTTGAA CATTTCACAT 
CACTCTTAAT ATATAAAAAA TGAATTAATC AGAAACCTCT GACTAACATT TCCTAATTAA 
TTCACTTTCT ATATCATAGT AAGGAATTCT ATTATCCCTA ATTGAAAATT GAAATTTTAT 
GTTTTATATA TTAACAATTA TGCGGATTCT AAATCTTGTC TAACAAAATG GCAAGTGCTA 
CTATGTGCCC CAGAAGGCGA TGCAACGCTA TTTTGAATTG AAAGAGCATA ATCATCCATA 
TCATTTAAGT CACGGATTAG CAATGCTTCC TTCTCTCTTC CGACAATTCC AAATTTTCTA 
ATTACCTTTT CAGGATTATC AAAAAATTCT CCAACAACTT CCATATTTCC TTGAAGTTCA 
TTCAAGAAAG CTTTCATTTG ACTACTCATT ATATAGCTCC TTTTCTATTA CTTTATTTGG 
AATCAAAACT TACTTGTACA TTGGAAACAC CTCTATTCTA CGCTTTCATA TTGCTGCATG 
ACACTTTCAA AATCAAATTG CTAAAAATAA TTTTTTAAAG CTTAATTTAG ATTTAATTAC 
ATATATCTCA AAAAATTGTT TTGAAATTAG TAAATTAAAA TAGGTTTCTG TACTTATAGG 
AACTAGTTAT AAAAACTTCG CCCATCATAA AATATCTATT TAAGTAAAAC AAAAATTTTA 
TAATTTTTTG ATTTTTAAGT GACTATAATC TCCTATCTAT AAATACCATT CGCAGGACCT 
GGATCAATCC CTCTAGCCAT CTTATGAACT TGAGTTCCTC CAGACAGTCC CGG 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDBDNESS: double 
(D) TOPOLOGY: linear 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2453 



WO 98/18931 



PCT/US97/19588 



1345 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

CCAATTTGAA GGCTCTAAAA CAATGGAAAA CTGCTACACA GATGTGACAG AATTTGCCAT 60 

TCCAGCAgTA CTCAAAAACT TTACTTATCA CCAGTTTTAG ATGCCTTTAA CAGCCAAATT 120 

ATTGCTTTTA ATCTTTCTTG TTCGCCTAAT TTAGAATAAG TACAAACAAT GTTGGAACAG 180 

GCATTCAAAG AGAAGCACTA TGAGAATACG ATTCTCCATA GTGACCAAGG CTGGCAATAC 240 

CAACACGATT CTTATCATCG GTTCCTAGAG AGTAAGGGAA TTCAAGCATC CATGTCACGC 300 

AAGGGCAACA GCCCAGACAA CGGCATGATG GAATCTTTCT TTGGCATTTT CAAATCGGAG 360 

ATGTTTTATG GTTATGAGAA GAACTTTAGA TCTTTAGAAA ACCTTGAACA AGCTATTGTG 420 

GACTACATTG ATTATTACAA CAACAAGAGA ATTAAGGTAA AGCTAAAAGG ACTTAGCCCT 480 

GTGCAATACA GAACTAAATC CTTCGGATAA ATTAATTGTC TAACTTTTGG GGTGCAGTAC 540 

ATTTTTGGTA TATATAAAAT TTGTAGGAGC TATATCTACA ATTTTATATT CCCAGTTTAT 600 

GGATGTAACT TACTATATTC ACAATGTTAT CCAGTCTTTT TTCTCTAATA TTTAAGGAGT 660 

GTTCTGTTTC TCGAATAAAT TCTTCAAAGT TTAACCCGTC AACTTGTTCC TGAACAAGAA 720 

AATAATCATC CACGATATAA AATTCATCAG TTAAATTAGT AGTATAACTT TTATCGGCTA 780 

ATTTTTTTAG CATGTGAGCT TCATTTTTTA TATCATCAAG AGCTGTCCAT TCTCCTTCAG 840 

CATCATAATT CACAAAAGGT CTTGACTGCT TGATGATTAC TTTTTGCCCG TCCGATTTTC 900 

TAATTGCCCG ATAAACATTT CCTTTATTTG ATCTCTTAAT AATTTTTTCC ATTTTGTATT 960 

TATTTATTGC AGAGTCCTTA CTTGAAACTT CACATGTGGT TTGAAAATAA ATCCTTTTTT 1020 

CTTCTTCTGA AAATAAATCC ATTTTCCGG X049 

(2) INFORMATION FOR SEO ID NO: 318: 

(a) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTGCC TTGCTTGCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 240 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 300 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTGCT ATTTTTGTGA TTATGGCTCT 360 

TATTATGGGA ATATACCTAT GAATTGGGTT GTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAACCATTA ATAATAAAGG TGTTAGAATT 480 

ATACGTGGTG CTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCACA GATGATGTCA 720 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 776 
(2) INFORMATION FOR SEQ ID NO: 319: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 240 

GAAAATGGCC GCTTTTCTGG ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 300 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGCGCTGAC 360 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCCATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGCAG TTCTACCGGC AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 658 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 
(B> TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

CCGGCTTAAT TTTTAGAAAA CGTGGGCAGG GAACCTTTGT TCTCTCTCGT GGCAGCTCAA 60 

AAAGAAAATT AATCGTTCCA GAAACAGATA TCCGGGGACT GACAAAAATA TCTGAAGATG 120 

CTCATTCTAC AATTGACTCG AG G ATT ATT C ACTTCAAATT AGAATTTGCA AATGAATTTT 180 

TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC CGCCTGCGTA 240 

TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT GTTATTCCAG 300 

GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAATTA CATTGAAGGA AAGTTAGGAT 360 

TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA AATGAGCAAC 420 

ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG GCTTATTTGG 480 

ATAACGGAAC TCCGTTTGAG TACTCGATTA GTCGTCATCG CTATGATTTA TTTGAATTTA 540 

ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC CAATCTTTTA 600 

CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAG GTCCCATCTA TGCTTAAATG 660 

GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA TCATGTTGAA 720 

AAATATCTTC GTATAGCTTA TAGAGTAGGT ACTGAAATTG TTCACCTGAT CTACTTCTTA 780 

TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA GTTTTTGAGT 840 

CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA CTTCTAACAA 900 

ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT TTAATATATT 960 

AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT CATTACGAAT 1020 

AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA TTAGTTCAAA 1080 

AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTG GATCATCGTA AATTGTTTGA 1140 

TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA AAAATGGTAC 1200 

AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT ACTTCTAAAA 1260 

TATTGTTAGA AATCGATTTG ACTATCCTGA TCTATTTGTC CTGTTCTTAT TTCATTTCAC 1320 

TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC ACTTTCTTCA 1380 

TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT TAAAGTTGAA 1440 

CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 1475 
(2) INFORMATION FOR SEQ ID NO: 321: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATACTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTGAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA T G T TG TTTTT GGTGCGGGTT 240 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 300 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 360 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGA G TTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 480 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 540 

TTAGCCCATA ATCTATAAAG 560 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 120 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 240 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 300 

TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 360 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 420 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 480 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGCGTA GGCAGCCGAG AAGAACTCTT 540 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTC 600 
GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) S T RAND EDN ESS : double 
tO) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

GGTACCCACT CATTCTTGAT GAATTGTGAA CAGTTGCCCT TGGGTCGTTT TGCGACTTGA 60 

AGTCAAGAAG AGGAAAAAAA CAAAAAGGAG AAATACTCAT GGCAGTAATT TCAATGAAAC 120 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACCAAACTCG TCGCTGGAAT CCTAAGATGG 180 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 240 

AATACGCTGA CCAAGCATAC GACTTCATGC GTGATGCAGC AGCTAACGAT GCAGTTGTAT 300 

TGTTCGTTGG TACT AAG AAA CAAGCAGCTG ATGCAGTTGC TGAAGAAGCA GTACGTTCAG 360 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTGG GGAACAATCC 420 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTATGGA AGAAGATGGA ACTTTCGAAG 480 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 540 

TGGGCGGTAT CGAAGATATG CCTCGTATCC CAGATGTGAT GTACGTAtTG ACCCACATAA 600 

AGAGCAAATC GCTGTTAAAG AAGCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA 660 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG 720 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGT 780 
(2) INFORMATION FOR SEQ ID NO: 324: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 



60 
120 
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AGGAAAAGCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCGTAT CTCTCCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 



(2) INFORMATION FOR SEQ ID NO: 325: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

TCTTATGAAG CCGAAGCGTG ATTTATGGCG GATAGGTTTG GTCTGCAGAA AGTGACAAAT 60 

CTAGTGCCAT CAGCGTATAT GGAATCTnTG GCTGAGAAAC AGTCCCGGGG TGAACTGACT 120 

TATGAGCAGG TTTATGAGGA TGCAACGGCT TATCATCATA CCATTGATGC GAGTACAGAG 180 

GAGGCAGACT TGGTTTCTCT ACGTATTGTA GAACTATTGT CTCGAAGAGG CTTTAGCTTC 240 

AGTCCTGCGA TCTTACTTGC TATTCATAAG GAGTTGTTTC AAGATATATT TGAACCCTCG 300 

ATTCCGGTAG GTCAATTTCG TCAGACTAAT ATCACAAAGA ATGAACCTGT TTTGAATGGT 360 

GAAAGTGTTG TGTACTCTGA TTACTCCATG ATTCAAATGA CCTTGGATTA TGATTTTAAT 420 

CAGGAAAAAC AAGTTGCATA TGCGACACTA ACCCAGGCGG ATATGGTTAA AAAAATCCAG 480 

CATTTTATTT CAGGAATCTG GCAGATTCAT CCATTTCGCG AAGGAAACAC TCGGACGGTA 540 

ACGGTATTTT TGATTCAGTA TCTTCGTGAG TTTGGTTTTG ATATTGATAA TACACCATTT 600 

CAGCAACATT CCAAGTATTT TCGTGATGCC TTAGTGTTAG ATAATGCAAA GATTTTACAG 660 

CGACGTCCTG AGTTTTTAAC AGCTTTTTTT GAAAATCTCT TGCTCGGTGG TCAAAATGAT 720 

TTGTCTTCAG AAAAAATGTA TCTAGATTTA GACCTCGATC TTTCATAATC CTAATACTGA 780 

GTAAACATTG AATTTTAGGA AAAAATGAAG TAAATATTCT CACAAGAAAA CGTATATCAT 840 

CAAAGTTTGG CTCTTTGTCA ATTGTAGTGG GTTGAAGAAA AGCTAAGTTC GAGAAAGGGC 900 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 
CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 
CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 
GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 
GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 
GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(0) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAGACTCA CTACTAGCTT AAAACATGTT 240 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 360 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 420 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 461 



(2) INFORMATION FOR SEQ ID NO: 327: 

< X > SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
iD) TOPOLOGY : linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 



960 
1020 
1080 
1140 
1200 
1237 
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ATAGAATTCA TAAAAGCCAT CGCTCATTAA AACAATTTGT TCACTAGTAA CATCTATTTG 240 

ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTAGAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 480 

TTTTTCTGAA AGTTTAGCAA ATATTGGTAA CCGATAATAT AGTATATTGA AACTAGAATA 540 

GTACACCTCT ACTTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTGATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTGAT TTTGTGAACC 84 0 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 960 

GTTCCATAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 1020 

TAGATTTATT AATATGATTT TCAGTTTCTC TGCCATCTCC AACTATTTTA TAGTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT TTTTCGTTTT TATCACTTAA ATTATTTGTT ACAATTTTGT 1260 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACAGGTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 1380 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 1436 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 
<C> STRANOEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 
ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGGAGATA TTCGATTTGC 
CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 
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GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG 240 

TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 420 

ATGCTTGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 460 

TACGTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 540 

TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TCCCCATCAC 600 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 646 
(2> INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



GTTGCAGGTG 


CAGTAGGTGT 


TACTTCAGAT 


ACATTTGAAC 


GTGCAGAGGC 


TCTTTTTGAG 


60 


GCAGGAGCGG 


ATGCGATTGT 


TATTGATACT 


GCACATGGTC 


ATTCTCCAGG 


TGTCTTGCGT 


120 


AAAATTGCCG 


AGATTCCTGC 


TCATTTCCCA 


GATCGGACTT 


TGATTGCTGG 


AAATATTGCT 


180 


ACTGCTGAAG 


GTGCACGTGC 


CCTTTATGAA 


GCCGGTGTAG 


ACGTTGTTAA 


GGTTGGTATT 


240 


GGACCAGGTT 


CTATCTGTAC 


TACTCGTGTG 


ATTGCTGCTG 


TTGGTGTTCC 


GCAAGTAACA 


300 


GCTATCTACG 


ATGCTGCAGC 


TGTTGCGCGC 


GAATATGGTA 


AAACGATTAT 


TGCTGACGGT 


360 


GGGATCAAGT 


ATTCTGGAGA 


TATTGTAAAA 


GCACTTGCTG 


CAGGTGGAAA 


TGCTGTTATG 


420 


CTTGGATCTA 


TGTTTGCTGG 


AACTGATGAA 


GCTCCAGGCG 


AAACTGAAAT 


CTTCCAAGGA 


480 


CGTAAATTCA 


AGACTTACCG 


TGGTATGGGA 


TCAATTGCTG 


CTATGAAGAA 


AGGTTCAAGC 


540 


GACCGTTATT 


TCCAAGGTTC 


TGTCAATGAA 


GCAAACAAGC 


TTGTTCCAGA 


AGGAATTGAA 


600 


GGTCGTGTTG 


CTTATAAAGG 


AGCGGCAGCT 


GATATTGTTT 


TCCAAATGAT 


TGGTGGTATT 


660 


CGCTCTGGTA 


TGGGTTACTG 


TGGTGCAGCT 


AACCTTAAAG 


AACTACACGA 


TAATGCTCAA 


720 


TTTATTGAAA 


TGTCTGGTGC 


TGGTTTGAAA 


GAAAGCCATC 


CTCATGATGT 


GCAAATTACT 


780 


AATGAGGCAC 


CAAATTATTC 


TATGTAAAAA 


ACAATGAAAA 


GAACTCCAGT 


GAAAACAGGA 


840 


GTTCl"rTTAC 


AATGTTGTCA 


ATTTCCATTT 


ACAGCAGCTT 


TACCATCCTG 


AATAGTGAAG 


900 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC l T GTl ' GTr GT GATAAAGGTT 960 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTGTC TAGT6TTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 1140 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TCAAAGATTC GGATAAGTTT 1260 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGGATATTG ACAACTCTTC GATCTGATTA 1320 

GAGAGTTCAA AATGTTTCTT ACGCCCAAAT GATTCTAGTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATGTT 1440 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAAGT CAGATAAATA GATTGGCTTA ATTTGCCCAA GTTCCATATC AATGAATTTT 1560 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 1620 

TTCATGTGTC CTACATAATC TGAAAGGCGT GCC 1653 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA CGTTTTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 240 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 300 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 360 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 420 

TATGCACATA TACACTTAAA TTCATATAAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GACTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 780 

AAATGATCAT CTAAGAATCT TGCTACATCT TCAAGTGCTT TAT CATC ATC AATTTCAATT 840 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTCCC 960 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAC TTTGATACGG ATTAAACTTT 1200 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CATACTATAA 1260 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 1320 

CCTCCTTCAT AAAACCGGTA 1340 

{2> INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS : 

(AJ LENGTH: 607 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTCTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 360 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 480 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 607 
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12) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

TTAAAATACC GAATTTTGTT TTGTCCTCTA TTTCAACATT GTGAATCGCC TCAGGCAGAG 60 

AACCGATACT AAAGATATAA CCAAAATAGT TGTCATTTGC TTTACCGATA TCAATCTTAT 120 

TGGTTAAATC AAAATCCAGT TCGTCAATTG CGCCATCGAT GTCTTGATTG ATTTCCAAAA 180 

GTTTTGTAAT GAGGTTACCC GTACCGCCTG GCATAATCCC TAACTTAGGA ATGTAGTCTC 240 

TCTCATCAAT ACCTGAAATG ACTTCATTGA CACTTCCATC TCCACCAAAC ACAACCACTG 300 

CATCATACTG CTCACGAGAA GCTTCTTCAG CAAAATGTGT TGCATCCAGC GCTTTTTCGG 360 

TAATTTTGGT TTCAACATAT TCAAAGTATT CTTTTGCTTT ATTCTCCAGC T TTT C TTTGT 420 

AATCCAAAGC CTTCTCGCCA CCAGAAGTAG . GGTTGATAAT TACCATTGCT TTTTTCATTG 480 

ATTTTATCCT TAATTTTAAA CAGAAATGTT TACATTTCGT CGTATGCAAG TAAATGTAAT 540 

CCTATTATAC AATGAAAATA CAGAAAAGAG AAATCTGACG TACTGGAGAT TAATACGCTT 600 

TTATTCTATT TTCCCATCGC CTAACTACAT C C TTTAAGGG TTCATCCAAG TAAGAATAGG 660 

CCTTATCCTT GATCCAATCA GGAATACCGT AACCTGCCTC TGCTAwGCTA CAAGTGATTG 720 

CTGCGAGAGT ATCACTGTCG CCACCAAGTG AGATGGCATT TCTTATCGCA TCTTCGAAGT 780 

CTCTACTTTC AAGAAAGGCG ATAATGGCTT GAGGGACAGT TTCCTGACAT GTTTCGTTAA 840 

AACGATAGTT AGGACGGATT TCATCTAAAG TTTGAGATAG ATTGTAATCG TATTCTTTTT 900 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCCC TGCTGAGTTC 240 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTCG CCTACACTCT CAGAAACAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 360 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACCG TAGAAATGGC 420 

AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 480 

GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 533 

(2> INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 
fB) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(D> TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 
CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 
ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 
TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 
AGGTTGCAGA TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 240 
GGTTTGAAGA GACTACAACT TCTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 
ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 360 
TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 
AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 480 
ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 
ATGA 544 
(2) INFORMATION FOR SEQ ID NO: 335: 

* 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTTAGAATAT 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



120 
180 
240 
300 
349 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

CTCCGATAAC CACACCAGCA ATGGAAATAA TTCCATCGTT AGCATCAAGA ACACCCGCAC 60 

GCAGGATATT TAAACGACCT GCAAAATTTG AATCAATTTC GTGATTTGTT TCTGACGCTA 120 

AATTTCAAGT TCAAGTTAGC CATCAAGAAG TCTTCTCTGG GTGACTTGTA GTCCAAGCAT 180 

TTTTTAGGAT AGTTGTTAAT CCACTTTTCG ATGAATGCGA CTTCTTTGGG AGTCATTTTC 240 

TTGGTTCCCT TAGGTAACCA TCTACGAATG AGCCTGTTGT GATTCTCATT AGTTCCCGGG 300 

ATCCTCTAGA GTCGACCTGC AGGCATGCAA GCTTGGCACT GGCCGTCGTT TTACAACGTC 360 

GATGACTGGG GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT 420 

CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 480 

CCTGAATGGC GAATGGGGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC 540 

ACACCGCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC 600 

CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 660 

TTACAGACAA GCTGTCACCG TCTCCGGGAG CTGCATGTGT CAGAAGTTTT CACCGTCA'J'C 720 

ACCGAAACGC GCGAAACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT 780 

GATAAGGATG GTTTCTTAGA CGTCAAGTGG CACTTATCGG GGAAATGTGC GCCGAGACCC 840 

TATTTGTTTA TTTGTCTAAA TACATTCAAA TATGTATCCG CTCGTGAGAA AATAAACCTG 900 

ATAAATGCGT CAATAATATT GAAAAATGAA GAGTATGAGT ATTCTACATT TCCGTGTCGC 960 

CCTTATACCC TTTTTTGCGG CATGTTGCCT T C CT GT TTTT GCTCACCCAG AAAACGCTGG 1020 

TGAAAGTTTA AGATGCTGAA AAATCATTTG GGTGCACAAC TGGGGTTACA TCCAACTGGA 1080 

ATCTCCAnCA GCAGTTAAGA TCCTCTGACA GTTGTACACG CCGCAAGAAC TATTCCCGAT 1140 
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CAATGAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 



GCCCGG 

(2) INFORMATION FOR SEQ ID NO: 337: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1206 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 337: 

CTGCTCAACT CAGACAGTCA AATTTCTGAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCACGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TGGGCTTAGA CACTCCTATG ATTAACTTCA ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT TGAAAAGTGT TTGACTGTAG ACAAAAAATA TCTGCTAGAT 240 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAACCT 360 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTACCAC 480 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 540 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TCAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTTACCAACG ACCAAGATGG GGTTCCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 780 

CTGATACTCA ATGAGGGGCA AAGAGCGAAC TTA 813 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 683 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACGTTCG TACAGAGGTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CGACGATTAT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CC ATT ACT AT CTTCTTTGTC ATC 683 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: BS2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

CTACTTTACA TGGAAGTAGT CACTGAATTC CAGTTAGAAA TTACTTTGTA ACTACGTTTT 60 

GAGGAGGAGT AAAATGCTTT CCTACGTTCG ATATTACCCA CTAGCGATAG CTAAATTAAT 120 

GTGTCTGTGC TCTCCTAAAA TCTGCTGATT TATTACTGAC TAATACAGGA GGTTTTTTTT 180 

ATGgACAGAC AATCATATCT GCTATTGGTG TTTATATTTC CACCAGTATC GATTATTTAA 240 

TTATTTTAAT TATTTTATTT GCACAGCTAT CAC AGAATAA ACAGAAATGG CATATTTATG 300 

CGGGGCAATA TCTAGGCACA GGCTTACTTG TAGGGGCGAG TTTAGTTGCT GCTTATGTCG 360 

TTAATTTCGT GCCTGAAGAA TGGATGGTTG GATTGCTTGG TTTAATCCCT ATCTATTTAG 420 

GGATTCGCTT TGCAATTGTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA ATTATTGAAA 480 

GATTAGAACA AAGCAAGGCA AATCAACTGT TTTGGACAGT TACATTGCTG ACAATTGCGT 540 

CTGGCGGAGA TAATTTAGGT ATCTATATAC CTTATTTTGC TTCGTTAGAT TGGTCACAGA 600 

CCCTCGTGGC CTT G CTT G T G TTTGTAATCG GCATAATTAT CTTTTGCCAG ATTAGTCGGG 660 

TGTTATCCTC TATTCCGTTA ATATTCGAGA CAATTGAAAA ATACGAGCGA ATCATTGTGC 720 

CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG ATAGAGACTT 780 



WO 98/18931 



PCT/US97/19588 



852 



1361 

TTCTGATCGT GTAGATTTTT TTGTTTCACT AGGCATTTAG CCCGAGCTCA AATCAGCTCT 840 
CTGATTTTCA GA 

(2) INFORMATION FOR SEQ ID NO: 340: 

ti) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
CCGCACAAAA GCGCATAGTA TCAAGATTCT ATAAAGCCTT GATACTATGC CTTTTTAATG • 60 

GATAAATAGT TAGTCTTTTT TAAAGACCGG ATCTTTCAAA CTCTGCATAC TGGCATTGAT 120 

CACCGCGCCT AGGATAACAA TTTTAGCAAT CAAGATAAAC CAAAACATCA TAACAACAAG 180 

AAGAACGGAA CCTAAAATTC GGACATCCAC CAAATGATGG ACATAGTAAT TGAGATAACT 240 

AGAGAACAGA GTTAGTAAAC CTAAAATCAC TAAGAGAACA AAGGCACTGC CTGGTAGGGT 300 

ATAGCTAATT TTCCTGTTAG ATAGATTGGG AAGAAAATAA TAAAGCATGA CCAAGATACC 360 

AAAGAGGAGG GCGTAAATCA GAGGACCTGC CAACCCTTGT AAAGCCTGAT AGATAATGCC 420 

ATCTTTTGTC CAATAATGAG CAAGTAAAGC CAAAATCATC TGACCAAATA AGATCAAAAA 480 

CAAGGCAAAC GCAAAGAGGA GCTGCAACCA AAACTGACTA GGAGACTTAG CATCTGATGG 540 

GAAATAAGTC CACGACTCTT TTCGACGCCA TAACCCTTGT TAAAAGCTTT TTGCAAGAAA 600 

TTCATAGATT TTGAAAAACT CCATAACGCC GATAAAACAG AAAAACTCAA TAAACCTGTT 660 

GAAGGTTGCG TCAACACTTC TCTGGCTATT TTTTCCACAC CTTCATAGAG GCTTGGGGGG 720 

CAGACGTCTT TCATAAAGCC CAAAAATTCT CCCA 754 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 341: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 



60 
120 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 360 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 490 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACACTATT 660 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 707 



(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GCATTTTGAA AAACCATACC GATTTGACGA CGTATATTCC AAACATTTTC CTCAGTCAAA 60 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACCGTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACGTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTCATT TTCTTTATAG CGAAAAGAAA 240 

GATT T TTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 360 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACTAAA 420 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 480 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 540 

AGCCATACCT AACTCAATCA ACATAATAAA AGCCGACATA ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GATAATAGCT GCAACTATAT GACTCTCTAT 720 

CGAATTTCCT ATCGTTAAAA TAAAGATAAA AATAGGTATA AA 762 
(2) INFORMATION FOR SEQ ID NO: 343: 



WO 98/18931 



PCT/US97/19588 



1363 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND E ONE SS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTCAGC GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 240 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC G C AAACTACA 360 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACTGGGCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATCGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGCGT 520 
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(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34S: 

GCATCAAATC CGCCATCAAA GAAGTTCTCT GGATTTACCA AGACCAGTCA AATAGCTTAG 60 

AAGTGCTTAA TGACAAGTAC AATGTTCACT ACTGGAATGA CTGGGAAGTT GGAGACACGG 120 

GAACCATTGG TGAGCGCTAT GGTGCCGTTG TTAAGAAACA CGACATTATC AATAAGCTTC 180 

TCAAACAGTT GGAAACCAAT CCTTGGAACC GCCGCAATAT TATTTCGCTC TGGGATTACC 240 

AAGCTTTCGA AGAAACAGAT GGGCTGCTCC CGTGCGCCTT TCAGACCATG TTTGATGTTC 300 

GGCGTGTTGA TGGGGAAATC TATCTGGATG CGACCTTGAC CCAGCGCTCC AATGATATGC 360 

TGGTGGCCCA CCACATCAAC GCTATGCAGT ATGTGGCTTT GCAGATGATG ATTGCCAAAC 420 

ATTTTGGCTG GAAGGTTGGG AAGTTCTTCT ACTTCATCAA CAACCTCCAT ATCTATGATA 480 

ATCAATTTGA ACAAGCTCAG GAATTGCTCC GTCGGGAgCC GTCAAACTCC CAACCACGCT 540 

TGGTTTTAAA TGTTCCTGAT GGGACTAATT TCTTTGATAT CAAAGCAGAA GATTTTGAGT 600 

TGGTGGATTA TGACCCTGTT AAGCCACAGT TGAAGTTTGA CCTAGCTATT TAAAAGAATA 660 

GAAAAAAGAA GTTGAGAATA ATCCCAACTT CTTTTGTTTC TTAACGTGAT ACGCGGCGAC 720 

GAGCTGCTTT TTTACGGTTT TCTTCGATGA AAGCTGCTTT TTGCTCTTCT GGTTCGATTA 780 

CTTTCTTTTT AAATGCGTAT ACTGCACCTG CAACGGCAGC GACAGTTCCT GCGACACCTG 840 

TTACAAGACC TTTAGCGAAT CCTTTAGCCA TGAGTCTTCC TCCTTTATAT TCTCAATCAG 900 

CCAGCCTCCT CAAGAGGTCA CATTTTTCTG ACTGACCTTT TTGTGTTATA ATAATAGTAA 960 

CGAAAAAATG GGAATTTTTC AAGGAAAAAA GATGAGAACA AAA 1003 
(2) INFORMATION FOR SEQ ID NO: 346: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGGTC 60 
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TCAACGCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CCCCATGCTA 120 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGACCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

CCCCATATTG GCCTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTCCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTGTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG CGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 

CTCACGGCTG ACAAAGACTT TTTCTGAAGA 750 

(2) INFORMATION FOR SEQ ID NO: 347: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 596 base pairs 
(B> TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 240 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 300 

TACGAAGAAG GTCGATAGAT GTCAkTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 360 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 B: 

CAGAGTCAAC AGCCTGAGTT GAAGGCAACT TTAGACACAG CAGTTACGAC AGCTGAATGA 60 

GCTCCTCCAT CAGTTTTTTC TTTAATGAGT CCAGCTACAT CTTCAACTTC GAGGCCGTTA 120 

ATCACAATGT CAGCGCCTAC TTCTTTTGCA AGGGCAAGTT TGTCATTGTT GATATCGACT 180 

GCGATAACAT GAGCATTGAA TACTTTTTTA GCGTATTGAA CAGCGAGGTT ACCAAGTCCA 240 

CCAGCACCGT AAAGAACAAC CCATTGGCCT GGTTCAACTT 'ITCC'lTC m ' GATAGCTTTA 300 

TAGGTTGTTA CTCCAGCACA TGTGATAGAA GAAGCTTGGG CTGGATCAAG TCCGTCAGGA 360 

ACTTTGACAG CATAGTCAGC AGTTACGATA CATTGTTCAG CCATACCACC GTCTACTGAG 420 

TAGCCAGCAT TTTTCACTGT ACGGCAAAGG GTTTCGCGAC CAGTTGTACA GTATTCGCAA 480 

GTGCCACATC CTTCAAAGAA CCAAGCAACG CTGACGCGGT CACCGACTTT AAGGCTTTTC 540 

ACATCTGGAG CAATCTCTTT AACGATACCG ATACCTTCGT GCCCAAGAAC ACGTCCTGGG 600 

ACTTGACCAA AGTCACCATC AGCAACGTGG AGGTCGGTGT GGCAAACGCC CACAGTATTC 660 

ACTTCTACAA GTG 673 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 60 

TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 120 

GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 180 

TCCCTCACCA AGATTCCG X98 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 SO: 

GCTTCTTCTA TAGACAAAAA TATCATGGGT AAAATAATCA AGGCTATACC TAGAAGGAGG 60 

GACCAATCCA CTACTAATCC TAAGAACAAA ACACTCAAGA GAGCAGAAGA GAGAGGTTCA 120 

CTGGCACTGA TAACGGCAAC CACCAAAGGA GAAACCAAGG ACACAGCCTT CATGGAAATG 180 

AAAAAAGCAA AAGCCGTTCC AAAGAAAGCG ATAATGAGGC AAATCAAGAT ACTCCAAATA 240 

TCAAGAGTAA AGGAAAGCTG ATAAACCGGC GAGAGGACAT TGCTAAACAA ACCTGCCAAA 300 

ATCATCCCCC ACCCAACCGT AGGAACAAAA CCATAACGCT TAGCAAAAGG TTGGGGCAAG 360 

ATAACATTAA ACATAACACC CATGGCACTC AGCAAACCTG TTATAAGAGC TAGCGGCGTC 420 

ATGGATAACT GAGAGAGGTC TCCCTTTGTC GCCATCAAGC AAACACCCAG CATGGCAACC 480 

AAAACATAGA AAACAGCGCT TTTTGACGCT CGTTTTTGAT AAACCAAGCG ATTGTAAAAG 540 

AGGATAAAGA CAGGGCTAAT AAACTGTAAA ATAGTTGCTG TCGTAGCATT TGAGTATTCT 600 

ACACAGAGAT AGAAAAAATA CTGAACTGAA AAAATCCCCA AAATAGCATA GGCTAAAAAG 660 

GGCAGGTAAT TTTTCTTGTC TCGCCAAATA TCTAGCACTT GCGATTTTAA TTGTATTCCA 720 

GACCAAATGA GTACAAGACT CCCTGCCAGT GTCAAACGCA TAGAGGTAAT CCAGCCCGAA 780 

GACACCTGAT AATGAGTAAA GAAGTACTCT CCTAAAATTC CACAGATTCC CCATATTAAG 840 

CCGGATAGGA GCGAATAAAT TTTTCCGTTA ACAATCTTTT TCTGATACTG A 891 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 300 
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ATGCGTTTGA CTGTCAAATG GAGTG 325 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCCAATTTA AAATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 160 

AGTCAGTAGA TCCGTTTACT GCCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 240 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 344 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 692 base pairs 
IB) TYPE: nucleic acid 
fC) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCPTTG GCACAAATCA GTCTTCCAAT 60 

TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 120 

CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 180 

GTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTC GCCCTACTGC AGGCTATCTT 240 

TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 300 

GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CCGGATTCTC 360 

AGCTTCCATT TCCTAGCTGC AATGGCATTT GAAAAAGCTC TTGCTGTGGG C^TTCTTCCC 420 

TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 480 

CAACCCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 540 

TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 600 

GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGGTT GTGGATGAAA CTGACGAGTA 660 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 692 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



GTGATGGACT 


ACTGGTTCAA 


AACGCATCCA 


GAAGATTTTT 


TCGATAATGT 


CGGACCTCTT 


60 


GTAGCCAGTA ACTTTTTTCA 


TACTTACACC 


GAAGATTTCC 


ACTTGATGAA 


GGAAATTGGA 


120 


GTTAATTCTT 


TCCGCACTTC 


CATCCAATGG 


AGTCGACTCA 


TCAAGAATTT 


AGAGACAGGT 


180 


GAGCCTGATC 


CAAAAGGTAT 


TGCTTTCTAC 


AATGCCATCA 


TTGAAGAAGC 


TAAAAAGAAC 


240 


CAGATGGATC TTGTGATGAA 


TTTACATCAT 


TTTGATTTAC 


CAGTGGAACT 


TCTTCAAAAA 


300 


TACGGTGGTT 


GGGAAAGCAA 


ACATGTAGTG 


GAGTTATTCG 


TGAAGTTTGC 


CAAGACTGCT 


360 


TTCACATGCT 


TTGGAGATAA 


GGTTCATTAC 


TGGACAACTT 


TCAATGAGCC 


AATGGTCATT 


420 


CCAGAAGCAG 


GGTACTTATA 


TGCTTTCCAT 


TATCCAAATC 


TAAAAGGAAA 


GGGAAAAGAG 


480 


GCCGTACAAG 


TCATCTATAA 


TCTAAACCTT 


GCTAGTGCAA 


AAGTGATTCA 


ACTATATCGC 


540 


TCATTAGAAC 


TTGATGGAAA 


GATTGGGATT 


ATTTTAAACT 


TGACACCTGC 


TTATCCAAGA 


600 


AGTAATTCTC 


CAGAAGACTT 


AGAAGCAAGT 


CGATTTACAG 


ATGACTTCTT 


TAACAAACTC 


660 


TTCTTGAATC 


CAGCTGTTAA 


AGGAACTTTC 


CCAGAAAGAT 


TGGTAAAACA 


GCTAGAGAGA 


720 


GATGGCGTGT 


TATGGAGTCA 


TACCGAAAAA 


GAGCTTCAAC 


TGATGAAATC 


AAATACGGTT 


780 


GATTTTCTTG 


GAGTAAACTA 


CTACCATCCA 


AAACGTGTTC 


AAGCACAAGC 


AAATCCTGAG 


840 


GAATATCAGA 


CGCCCTGGAT 


GCCAGACCAA 


TACTTCAAAG 


AGTATGAATG 


GCTGGAGCGT 


900 


CGCATGAATC 


CATATCGTGG 


TTGGGAAATT 


TTTCCGAAAG 


CCATTTATGA TATTGCTATG 


960 


ATTGTGAAGG 


AAGAATATGG 


TAATATCCCA 


TGGTTTATCA 


GTGAA 




1005 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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txi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 60 

TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 120 

AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGTTCC TATGAGCTTG GACGTTACCA 180 

AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATA6ATGGTG ATCAGGCTGG 240 

TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 300 

ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 360 

TTACTATAAT GGCAAGGTTC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 420 

TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 480 

TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA GCTCATGCGG ATAATATTCG 540 

GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 600 

AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 660 

CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 720 

CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 780 

CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTTATA ATGCAAATCC 840 

AGCTCAACCA AGATTGTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 900 

TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 960 

CATGTGGGAT CTG 973 
(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
<D> TOPOLOGY: linear 

Oci) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 120 

CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 180 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 300 

GCTGAAAGAG TTTCCTTGAT CGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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GTCGCAAGGG CATTTTCCAC ATTGTGGCTA CCTGGAACAC CGATTTCATT CCCTGCCATG 420 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AAGCTCCATC AACCTTTTCA 480 

AGTGTTGAAA ATGGTACAAC AGTGGCTTCT GTCTTGGAAG TCAAGTCTTT TGCCAAGTCT 540 

TCATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTCCACTTG 600 

GCTGCTACAT ATTCCGAAAA TGACCCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 

ATAACCGCAA TCTCTGGATG GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC 720 

ATAACAAGCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCCGGn TCCTCTAGAG 840 

TCG 84 3 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

TTTTTTTTAT ATTTTTTTTA TTTATTATTT TTTGGCAAAA AAGACCAATT TGCTTTGGAG 60 

CATTGCTTCT GCATTAAATT GTCTATTTTT GCTCGTGCTG TTACGCTCTT TGTATCATGT 120 

ATTAACTAGC AAGTGCAACT TGCAAACTAC TAGTAAGAGG AGAAAAACAA AATGGTTATG 180 

ACTGACCCAA TCGCAGACTT CCTAACTCGT ATTCGTAATG CTAACCAAGC TAAACACGAA 240 

GTACTTGAAG TACCTGCATC AAACATCAAA AAAGGGATTG CTGAAATCCT TAAACGCGAA 300 

GGTTTTGTAA AAAACGTTGA AATCATTGAA GATGACAAAC AAGGCGTCAT CCGTGTATTT 360 

CTTAAATACG GACCAAATGG TGAGAAAGTT ATCACTAACT TGAAACGTGT TTCTAAACCA 420 

GGACTTCGTG TCTACAAAAA ACGTGAAGAC CTTCCAAAAG TTCTTAACGG ACTTGGAATT 4 B0 

GCCATCCTTT CAACTTCTGA AGGTTTGCTT ACTGATAAAG AAGCACGCCA AAAGAATGTT 540 

GGTGGTGAGG TTATCGCTTA CGTTTGGTAA AATCAAGATA CAAAGCTCGT AAAGAACAAA 600 

GCAAAATTAG GAAGTTGGAG AAGTTTGTTT ACAAACAGGC CAACTTATCT ATTTTGCACA 660 

GTTCTTAGAG CGTGTTCAGT TCAGCTCTTG AGCTAAGTAA GTATCTGAAC CCCGTGAAAA 720 

CTGGCCGTGC TGGCATGTTC GGGTAACAGG AGAnAATAAA CATGTCACGT ATTGGTAATA 780 

AGTTCAGCTA AGGCCTTCGT AAAAGTT 807 
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(2) INFORMATION FOR SEQ ID NO: 356: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 S3 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGCCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GCCACCCTAT 180 

ATGCATCAAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 300 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 360 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTCCCAAT CTAGCTGGTC 4 80 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 653 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CACGCGAGTT GACCAGTCAG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 300 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 360 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 
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TCAACCAAGT AGGAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 480 

CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 540 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 641 



(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



CCTCAAGGCC 


AATTTGAAGG 


CTCTAAAACA 


ATGGAAAAGT 


GCTACACAGA 


TGTGACAGAA 


60 


TTTGCCATTC 


CAGCAAGTAC 


TCAAAAGCTT 


TACTTATCAC 


CAGTTTTAGA 


TGGCTTTAAT 


120 


AGCGAAATTA 


TTGCTTTTAA 


TCTTTCGACT 


TCACCCAACT 


TAGAACAAGT 


ACAAACAATG 


180 


TTAGAACAGG 


CATTCAAAGA 


GAAGCACTAC 


GAGAATACGA 


TTCTCCATAG 


TGACCAAGGC 


240 


TGGCAATATC 


AACACGATTC 


TTATCATCGG 


TTCCTAGAGA 


GTAAGGGAAT 


TCAAGCATCT 


300 


ATGTCACGCA 


AGGGCAACAG 


CCAAGACAAC 


GGTATGATGG 


AATCTTTCTT 


TGGCATTTTA 


360 


AAATCCGAAA 


TGTTTTATGG 


CTATGAGAAA 


ACATTTAAAT 


CACTTAACCA 


ATTGGAACAA 


420 


GCCATTATAG 


ACTATATTGA 


TTACTACAAC 


AACAAACGAA 


TTAACGTAAA 


ACTAAAAGGA 


480 


CTTAGTCCTG 


TGCAGTACAG 


AACTAAATCC 


TTTGGATAAA 


TTAATTGTCT 


AACTTTTTGG 


540 


GGTCAGTACA 


AAACTCTTGC 


TACTATGCGT 


TTTATTATTG 


AAAGACTTAT 


TGGACTTTCT 


600 


CTCAAATCGA 


GTTTTTACTC 


AATTTTCTTA 


CTTGATTGGG 


ATTGAAATTC 


CAATTAATTT 


660 


CTCTGAGTAG 


AGTGTCTTGA 


TATTGGCTTC 


ATCAACAGAG 


GCCTTATCAA 


TTTTACGTTT 


720 


CAAGAAAAAT 


TCTTGAATGG 


TTTCGATTTC 


AGGCTCACGA 


ATAGCACGGT 


GTTTGTTTGA 


780 


GATGAGGATT 


TCATAGTGAA 


GCGGAGCTTG 


GGTAAAAATA 


ACATCTGTAT 


TCCCTGCAGA 


840 


ATAAACCTCA 


ACAAGGGTTG 


CATCGGTACT 


TTCTAGCTGA 


CTTTTTACAA 


GTTGCGAGTG 


900 


TGAGTTTGTC 


GTATTGATAA 


GCTTCATAAT 


ATTTCCTCCG 


ATTTTCTAAT 


TCTATTATAG 


960 


CACTTTrTGA 


ATAAAGTCGC 


TTGATTTATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


1020 


GCTAGCCGCA 


GGCTATACTT 


GAGTACGGTA 


AGGCGACGCT 


GACGTGGTTT 


GAATTTTATT 


1080 


TTCGAAGAGT 


ATTAGCCAAT 


CTTATGCTGT 


TTTTTCCAAG 


ATTCAATGGC 


CCATTTATGG 


1140 
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CTACCACGTT 


TAAGGTTTTT 

™ ™ ^» V 4 4 4 4 


GATAGCCTCG 


TCAATAGGGA 


ACCACfiCAAT 


a I \»r\ l l nnm 


.1 A Uv 


TTTTCTAGTG 


GC1TTTGTAC 


TTCTTTGAAA 

4 4 X» 4 4 4 * M M 4 


GCAfiTTfirTT 


CATACAGGTA 




1 9 fin 


TAGTAGTAGG 


TATCACGATG 


ACGAGAATAG 


nllA A n A 1 V* A 








CCAATTTCTG 


CTGTCAAACC 


AAGCTCTTCA 


ATCAACTCAT 


A A * JMMJ>JW 


TTPPTftAIYIA 




TTTTCACCTG 


CTTCAATTTC 


TCCACATGGT 


AGGAACCAAG 


CACCATTTfif! 

V*/\V>\_*\ All VJVJ 


I 1 \» i 1 WW- A 




AGAACAATTT 


\m 4 4 4 4 4 A & w 




#w /w\v» 1uv.a i 






1 CAA 


TAGTCTGTAT 


tcactttttt 


TCTCCGAAAG 


TTGGGTTTGC 


CATTGCATTT 


TCCTCATTAT 


1560 


CTAGTATCGT 


TATTATTATA 


GTGAAATGAA 


CCAAAAATAG 


TACACAATGT 


GGTATAATCT 


1620 


TCTTATGGCA 


TATTCAATAG 


ATTTTCGTAA 


AAAAGTTCTC 


TCTTATTGTG 


AGCGAACAGG 


1680 


TAGTATAACA 


GAAGCATCAC 


ACGTTTTCCA 


AATCTCACGT 


AATACCATTT 


ATGGCTGGTT 


1740 


AAAGCTAAAA 


GAGAAAACAG 


GAGAGCTAAA 


CCACCAAGTA 


AAAGGAATAA 


AACCAAGAAA 


1800 


GGTTGATAGA 


GATAGACTTA 


AAAACTATCT 


TACTGACAAT 


CCAGACGCTT 


ATTTCACTGA 


1860 


AATAGCTTCT 


GAATTTGGCT 


GTCATCCAAC 


TACCATCCAC 


TATGCGCTCA 


AAGCTATGGG 


1920 


tACACTCGAA 


AAAAAAAAGA 


ACTACACCTA 


CTATGAAC 






1958 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGAAG CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 120 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATOGTT ACTATTCTCC 180 

TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 240 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 300 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 360 

CCATCCTAAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 420 

AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 480 

ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 540 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 600 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 

GCTTTACTTA T - 851 
(2) INFORMATION FOR SEQ 10 NO: 362: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGCTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 

GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA 300 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 360 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAACACTTGT 420 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGTTTTTTT 480 

CTACACTTAT ACCATAAAGG GGAAACTCTP TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 900 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 960 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 1020 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 1080 
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ATCTTCTTCC AGATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC 1140 

GATAAAAATA AGTATCGAAT CCTGTTTC 1168 
(2) INFORMATION FOR SEQ ID NO: 363: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 60 

GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCC GCTTCAGCAT CAACGAGTGC 120 

GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 180 

TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 24 0 

GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 300 

ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 360 

CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 420 

ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 480 

ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 540 

AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 600 

CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 660 

AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 720 

ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 780 

AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 840 

CTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 900 

AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 960 

CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 1020 

ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 1080 

ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 1140 

GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGT GC GTCT GAGTCAGCAT CAACGAGTGC 1200 

GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 1260 

ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 1320 
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TCrGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCkTC AGCCTCAGCG 1380 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 1440 

TCAGCAAGTA CTAGCCCCTC AGCCTCAGCA TCAACGACTG CGTCCGCTTC ACCAAGTACT 1500 

AGTGCATCAG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCGACAAG CGCCTCAGCT 1560 

TCAGCAAGTA CCACTGCGTC AGCCTCAGCG TCGACAAGTG CGTCCGCTTC AGCAAGTACC 1620 

TCAGCGTCTC AATCAGCATC AACAAGTGCG TCGGCTTCAG CATCAACGAG TGCATCAGCT 1680 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 1740 

AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCCG CATCAACAAG TGCCTCGGCT 1800 

TCAGCAAGCA CCAGTGCTTC GGCTTCAGCG TCAACGACTG CGTCTGAGTC AGCATCAACG 1860 

AGTGCGTCAC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCACTTC 1920 

CGCATCAACA AGCGCCTCGG CCTCAGCAAG TACAAGTGCT TCAGCCTCAG CATCAACCAG 1980 

TGCATCAGCT TCAGCCTCAA CAAGTGCTTC AGCCTCAGCG TCAACCAGTG CCTCGGCTTC 2040 

AGCAAGTACC AGTGCGTCAG CTTCAGCAAG CACAAGTGCG TCAGCTTCAG CATCAACCAG 2100 

TCCTTCGGCT TCGGCATCAA CAAGTGCCTC AGCATCAGCA TCAACGACTG CGTCAsCTCA 2160 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACCAGTGCAT CAGCCTCAGC AAGTATCTCA 2220 

GCGTCTGAAT CGGCATCAAC GAGTGCATCA GCATCAGCAT CAACGACTGC ATCCGCTTCA 2280 

GCGTCAACCA GTGCATCAGT CTCAGCAAGC ACCAGTGCGT CGCCTTCAGC ATCAACCAGT 2340 

GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT CAACGAGTGC GTCAGcCTCA 2400 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACGAGTGCAT CGGCTTCAGC AAGTACCAGC 2460 

GCCTCAGCTT CAGCAAGCAC CAGTGCGTCA GCCTCAGCAA GTACCAGCGC CTCAGCCTCA 2520 

GCAAGCACCA GTGCCTCAGC TTCAGCAAGT ACCAGTGCGT CAGCCTCAGC GTCGACAAGT 2580 

CCGTCGGCTT CAGCAAGTAC CTCAGCGTCT GAATCAGCAT CAACGAGTGC ATCACCTTCA 2640 

GCATCAACAA GTGCTTCAGC TTCAGCAAGT ACCAGTGCGT CGGCTTCAGC ATCAACGAGT 2700 

GCTTCAGTCT CAGCGTCAAC CAGTGCCTCT GAATCAGCAT CAACAAGTGC CTCGGCTTCA 2760 

GCAAGCACCA GTGCGTCGGC TTCAGCAAGT ACTAGTGCAT CGGCTTCAGC ATCGACAAGT 2820 

GCGTCTGAAT CGGCATCAAC GAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCAGCCTCA 2880 

GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC GTCAACCAGT 2940 

GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCGGCCTCA 3000 

GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCCGC TTCAGCAAGC 3060 
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ACAAGTCCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCG 3120 

GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT CAACCAGTGC GTCGGCTTCA 3180 

GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGACT 3240 

GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCG 3300 

GCATCAACCA CTGCCTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT 3360 

GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG GCCTCAACCA GTGCATCPGA ATCGGCATCA 3420 

ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT CAGCATCAAC GAGTGCATCG 3480 

GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGTCAACCA GTGCGTCAGC TTCAGCAAGT 3540 

ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT CAGCATCGAC AAGTGCCTCG 3600 

GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA GCGCCTCAGC TTCAGCAAGT 3660 

ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA 3720 

GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCGTCAACCA GTGCATCAGA GTCAGCAAGT 3780 

ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT CAGCAAGCAC CAGTGCGTCG 3840 

GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA GTGCGTCAGC CTCAGCAAGT 3900 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT CCGTCCGCTT CAGCAAGTAC TAGCGCCTCA 3960 

GCCTCAGCGT CAACAAGTGC ATCGCCTTCA GCCTCAACCA GTGCGTCTGA ATCGGCATCA 4020 

ACCAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCACCCT CAGCGTCAAC AAGTGCATCG 4080 

GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCGTCA 4140 

ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT CAGCATCAAC GACTCCGTCA 4200 

CCTCAkCAAG CACATCAGCT TCTGAATCTG CATCAACCAG TGCCTCACTT CCGCATCAAC 4260 

AAGCCCCTCG GCCTCAGCAA GTACAACTGC TTCAGCCTCA GCATCAACCA GTGCATCAGC 4320 

TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG TGCCTCGGCT TCAGCAAGTA 4380 

CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA GCATCAACCA GTGCTTCGt.r 4440 

TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT GCG 4483 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base paira 
(8) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(O) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 364: 



WO 98/18931 



PCT/US97/19588 



1379 

GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGOGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTPC AGCCTCAGCG .360 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT ACTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTCCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGGTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 1260 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACGAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTOGCCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCCGCATCAA CAAGTCCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 



4 
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CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC TCAACCAGCG CCTCGGCCTC 1800 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG I860 

TGCTTCCCCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 1920 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCCTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC 2040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 2220 

GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2340 

GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAAGTA CTAGTGCATC 24 60 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2520 

TACCAGTGCG TCAGTTCACG CATCAACAAG 2S50 
t2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 
ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCCGCT TCAGCATCAA 
CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCCTCAC 
CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 
CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCACCC 
TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 
AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 
TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 
AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 
TCAGCCTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 
TCACCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
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TCAGCAAGTA 


CCAGTGCGTC AgCCTCAGCA AGTACCAGTG 


CTTCAGCCTC 


AGCGTCGACA 


660 


AGTGCGTCGG 


CCTCAACCAG 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


720 


AGTACTAGCG 


CCTCAGCCTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTAGTGCA 


780 


TCAGCTTCAG 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCGA 


CAAGCGCCTC 


AGCTTCAGCA 


840 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CTTCAGCAAG 


TACCTCAGCG 


900 


TCTGAATCAG 


CATCAACAAG 


TGCGTCGGCT 


TCAGCATCAA 


CGAGTGCATC 


AGCTTCAGCA 


960 


TCAACAAGTG 


CTTCAGCTTC 


AGCAAGTACC 


AGTGCGTCGG 


CTTCAGCATC 


AACGAGTGCT 


1020 


TCAGTCTCAG 


CGTCAACCAG 


TGCCTCTGAA 


TCCGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA 


1080 


AGCACCAGTG 


CTTCGGCTTC 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC 


AACGAGTGCG 


1140 


TCAGCCTCAG 


CAAGCACATC 


AGCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC 


AGCTTCCGCA 


1200 


TCAACAAGCG 


CCTCGGCCTC 


AGCAAGTACA 


AGTGCTTCAG 


CCTCAGCATC 


AACCAGTGCA 


1260 


TCAGCTTCAG 


CCTCAACAAG 


TGCTTCAGCC 


TCAGCCTCAA 


CCAGTGCCTC 


GGCTTCAGCA 


1320 


AGTACCAGTG 


CGTCAGCTTC 


AGCAAGCACA 


AGTGCGTCAG 


CTTCAGCATC 


AACCAGTGCT 


1380 


TCGGCTTCGG 


CATCAACAAG 


TGCCTCAGCA 


TCAGCATCAA 


CCAGTGCGTC 


AGCCGG 


1436 



(2) INFORMATION FOR SEQ ID NO: 366: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



GCAGTTGCCA 


CACCGTGCTG 


ACCAGCACCC 


GTTCCTGCGA 


TAATTTTCTT TTTACCCATG 


60 


CGTwTGGCAA 


GCCAAACTTG 


TCCTAAGGCA 


TTGTTAATCT 


TGTGGGCTCC 


TGTATGGTTA 


120 


AGGTCTTCCC 


GTTTGAGATA AATCTTGCTC 


CGCCAATATG 


CTGGGTCAAG 

■ 


TTTTTTGCGT 


180 


AATAAAGAGG 


AGTTTCACGT 


CCTACGTACT 


GGCGCAAAAG 


CTGGTTTAAT 


TCCTCTTGGA 


240 


AACTTGGGTC 


TGCCTGACTT 


TCACGGTAGG 


CCTTCTCCAA 


CTCCAAAACT 


GCTGTCATCA 


300 


ATGTTTCTGG 


GACAAAACGT 


CCGCCGAATT 


TTCCGTAAAA 


TCCATCTTTA 


TTTGGTTCCT 


360 


GATATGCCAT GCTTTACCCT 


CTCTATAAAT 


CTTCTAATCT 


TTTCATGATC 


TTTTTGTCCA 


420 


TCTGTCTCCA 


CTCCGCTCGA 


TACATCTACT 


GCATAGGGAG 


TAAAGTGTTG 


AATTGCTTTT 


480 


ACTACATTAT 


CTTCATTAAG 


GCCACCTGCG 


ATAAAGAAGG 


GCTGTGCTAG 


TCCAGTCGTA 


540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTACA 600 

TAATCTGCCT GAGAATTGCG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 660 

GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 720 

AAGCCGGGGA TCCTC 735 

(2) INFORMATION FOR SEQ ID NO: 367: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 60 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 120 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 180 

TGAATCTGCA TCAACCAGTG CCTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG 240 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 300 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 360 

TGCATCTGAA TCGCCATCAA CCAGTGCGTC AGCCTCAGCA AGTACTAGTG CATCAGCTTC 420 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTTCAG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 660 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCCCG TCTGAATCCG CATCAACCAG 720 

TGCCTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGGCTl'C 780 

AGCAAGCACA AGTGCTTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCCG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 10 B0 

TGCGTOGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTACTGCATC 1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAC CTTCACCAAG 1260 

CACCACTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCACCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCCGCT TCAGCAAGCA CCAGTGCCTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CG 1702 
(2) INFORKATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 
<C> STHANDEDNESS : double 
(D) TOPOLOGY: linear 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

ACCAGTGCAT CAGCTTCAGC CTCAACAAGT GCTTCAGCCT CAGCGTCAAC CAGTGCCTCG 60 

GCTTCAGCAA GTACCAGTGC GTCACTTCAG CAAGCACAAC TGCGTCACTT CAGCATCAAC 120 

CAGTGCTTCG GCTTCGGCAT CAACAACTGC CTCAGCATCA GCATCAACGA GTGCGTCACC 180 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACCAGTG CATCAGCCTC AGCAAGTATC 240 

TCAGCGTCTG AATCGGCATC AACGAGTGCA TCAGCATCAG CATCAACGAG TGCATCGGCT 300 

TCAGCGTCAA CCAGTGCATC AGTCTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 360 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC GGCTTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACC 480 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 540 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC 600 

AGCGCCTCAG CTTCAGCAAG CACCACTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC 660 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 720 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 780 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 840 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 
TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 941 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO* 








CAGCAAGTAC TAGTGCATCA GCTTCAGCAT r AAmArrror* 




GCGTCAACCA 


60 


GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC 


ATCAACAAGT 


GCCTCGGCTT 


120 


CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC 


CTCAGCCTCA 


GCCTCAACCA 


180 


GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC 


ATCAACGAGT 


GCCTCCGCTT 


240 


CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 


ATCGGCTTCA 


GCGTCAACGA 


300 


GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC 


AAGTACTAGC 


GCCTCAGCCT 


360 


CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC 


GTCCGCTTCA 


GCAAGTACTA 


420 


GCCCCTCAGC CTCAGCGTCA ACAAGTGCAT CCGCTTCAGC 


GTCAACGAGT 


GCGTCTGAGT 


480 


CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC 


TTCTGAATCT 


CCATCAACCA 


540 


GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC 


AAGTACCAGT 


GCCTCAGCCT 


600 


CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC 


GTCAGCCTCA 


GCAAGTACCA 


660 


GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


720 


CAACCAGTGC GTCAGCCTCA GCAAGTACTA GTGCATCAGC 


TTCAGCATCA 


ACGAGTGCAT 


780 


CGGCTTCAGC ATCAACCAGT GCATCAGAGT CAGCAAGTAC 


CAGTGCGTCA 


GnTTCCGCAT 


840 


GCAACAAGTG CCTCGGCTTC AGCAAGTAC 






869 


(2) INFORMATION FOR SEQ ID NO: 370: 









U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGOGTCAG CCTCAGCAAG TACTAGTGCA 60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGC CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 240 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 360 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 480 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 540 

TCACCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 



(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 371: 



CCGGAAAACA GCTCTGGCGC 


TTGGTCTTGC 


CCAGCGTATT 


GCTAGTGGTG 


ACGTGCCTGC 


60 


GGAAATGGCT AAGATGCGCG 


TGTTAGAACT 


TGATTTGATG 


AATGTCGTTG 


CAGGGACACG 


120 


CTTCCGTGGT GACTTTGAAG 


AACGCATGAA 


TAATATCATC 


AAGGATATTG 


AAGAAGATGG 


180 


CCAAGTCATC CTCTTTATCG 


ATGAACTCCA 


CACCATCATG 


GGTTCTGGTA 


GCGGGATTGA 


240 


TTCGACTCTG GATGCGGCCA 


ATATCTTGAA 


ACCAGCCTTG 


GCGCGTGGAA 


CTTTGAGAAC 


300 


GGTTGGTGCC ACTACTCAGG 


AAGAATATCA 


AAAACATATC 


GAAAAAGATG 


CGCCACTTTC 


360 


TCGTCGTTTC GCTAAAGTGA 


CGATTGAAGA ACCAAGTGTG 


GCAGATAGTA TGACTATTTT 


420 


ACAAGGTTTG AAGGCGACTT 


ATGAGAAACA 


TCACCGTGTA 


CAAATCACAG 


ATGAAGCGGT 


480 


TGAAACAGCG GTTAAGATGG 


CTCATCGTTA 


TTTAACCAGT 


CGTCACTTGC 


CAGACTCTGC 


540 


TATCGATCTC TTGGATGAGG 


CGGCAGCAAC 


AGTGCAAAAT 


AAGGCAAAGC 


ATGTAAAAGC 


600 


AGACGATTCA GATTTGAGTC 


CAGCTGACAA 


GGCCCTGATG 


GATGGCAAGT 


GGAAACAGGC 


660 
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ACCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TCACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATGTTCCT AGGGCCTACA GCTGTCGGGG TATCCGA 957 

(2) INFORMATION FOR SEQ ID NO; 372: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
(B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
{ D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

CAAAGCGCCT CAGCTTCAGC ATCAACAAGT GCGTCGGCTT CAGCATCAAC CAGTGCCTCG 60 

GCTTCAGCGT CAACCAGTGC GTCACATTCA GCAAGTACCA GTGCTTCAGT CTCAGCATCA 120 

ACAAGTGCTT CAGCCTCAGC ATCGACAAGT GCCTCGGCTT CAGCAAGCAC ATCAGCATCT 180 

GAATCAGCGT CAACCAGTGC TTCGGCTTCA GCAAGTACCA GTGCTTCAGC TTCAGCATCA 240 

ACCAGCGCCT CGGCCTCAGC AAGCACCTCA GCTTCTGAAT CGGCCTCAAC CAGCGCCTCG 300 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GCGCCTCAGC CTCAGCATCA 360 

ACGAGTGCTT CGGCTTCAGC AAGCACAAGC GCCTCGGGTT CAGCATCAAC GAGTACGTCA 420 

GCTTCAGCGT CAACCAGTGC TTCAGCCTCA GCATCAACAA GTGCGTCAGC CTCAGCAAGT 480 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTACGTCA 540 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GTGCGTCAGC CTCAGCATCG 600 

ACAAGCGCCT CAGCTTCAGC AACTACCAGT GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG 660 

GCCTCAACCA GTGCATCTGA ATCGGCATCA ACCAGTGCGT CAGCCTCAGC AAGTACTAGT 720 

GCATCGGCTT CAGCATCAAC CAGTGCCTCG GCTTCAGCGT CAACCAGTGC GTCAGCTTCA 780 

GCAAGTACCA TGTGCTTCAT GTCTCAG 607 
(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



CATCGGCTTC 


AGCATCAACG 


AGTGCGTCCC 


CTTCAGCAAG 


TACTACCGCC 


TCAGCCTCAG 


60 


CGTCAACAAG 


TGCATCGGCT TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTG 


120 


CGTCACCTCA 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CACCTCAGCA 


180 


TCGACAAGCG 


CCTCAGCTTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


ACAAGTGCGT 


240 


CGGCTTCAGC 


AAGTACCAGT 


GCGTCASCTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


300 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


CAACCAGTGC 


GTCACCTCAG 


360 


CAAGTACTAG 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCAGCA 


TCAACCAGTG 


420 


CATCAGAGTC 


AGCAAGTACC 


AGTGCGTCAG 


CTTCCGCATC 


AACAAGTGCC 


TCGGCTTCAG 


480 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCTTC 


AGCTTCCGCG 


TCAACCAGCG 


540 


CCTCGGCCTC 


AGCAAGTATC 


TCAGCGTCTG 


AATCGGCATC 


AACAAGTGCC 


TCGGCTTCAG 


600 


CATCAACGAG 


TGCATCAGTC 


TCAGCAAGCA 


CCAGTGCGTC 


GGCCTCAGCA 


AGCACCAGCG 


660 


CGTCTGAATC 


CGCATCAACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCTCAGCA 


TCTGAATCAG 


720 


CATCAACAAG 


TGCATCGGCT 


TCAGCAAGCA 


CAAGTGCTTC 


AGCCTCAGCA 


AGTATCTCAG 


780 


CGTCTGAATC 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTACCGCC 


TCAGCATCAG 


840 


CGTCAACAAG 


TGCTTCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTA 


900 


CGTCAGCCTC 


AGCAAGCACA 


TCAGCTTCTG 


AATCTGCATC 


AACCAGTGCG 


TCAGCCTCAG 


960 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACCAGTG 


1020 


CTTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


GCTCAACCAG 


TGCATCTG 




1068 



(2) INFORMATION FOR SEQ ID NO: 374: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA 620 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 375: 

GTATTGGGGC GCCCCAACCT CTATGTGACT ACGGATTATT TCCTAGATTA CATGgGGATA 60 

AACCATTTAG AAGAATTACC AGTGATTGAT GAGCTTGAGA TTCAAGCCCA AGAAAGCCAA 120 

TTATTTGGTG AAAGGATAGA AGAAGATGAG AATCAATAAG TATATTGCCC ACGCAGGTGT 180 

GGCCAGTAGG AGAAAAGCAG AAGAGCTGAT TAAGCAAGGC TTGGTGACGG TTAACGGCCA 240 

AGTGGTGCGT GAACTAGCAA CCACTATCAA GTCAGGCGAC AAGGTCGAAG TTGAAGGTCA 300 

ACCTATCTAC AACGAAGAAA AGGTCTACTA TCTGCTTAAC AAACCACGCG GTGTGATTTC 360 

CAGTGTGACA GATGATAAGG GTCGCAACAC GGTTGTCGAC CTCTTGCCCA ATGTCAAAGA 420 

GCGTATTTAC CCTGTGGGTC GTTTGGACTG GGATACATCA GGTGTCTTGA TTTTGACCAA 480 

TGATGGGGAC TTTACAGACG AGATGATTCA CCCTCGTAAT GAGATTGACA AGGTTTATGT 540 

CGCGCGTGTT AAAGGTGTGG CCAATAAGGA CAATCTCCGC CCCTTGACCC GTGGTCTTGA 600 

GATTGATGGT AAGAAAACCA AGCCATAATA TATAGGTTTT GTAGCCTCTA CACCATAAAT 660 

ATTTGCTAAT AAAAATACTG TATTATTACC CTCTTAAGGT GCGAAATTAT TCAAGTTCTT 720 
C2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double \ 

(D) TOPOLOGY: linear 
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Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CACCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGCAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 4 80 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 

(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 
<B> TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAAGTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTGCGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAGCGTCA ACCAGTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 240 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 300 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 360 

CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 420 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 480 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 540 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 690 

(2) INFORMATION FOR SEQ 10 NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1003 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



CGAGATTCTC 


TGGAGTTATG 


GATGTCGTTC 


CAATATGTGC 


ACGTTGGAAT 


GTTAGTGCTT 


60 


ATATGGGGGG 


AACAGAATCC 


TCTCTTGATT 


GAAGACAAGC 


TAGTCATTAG 


GCTGGTTTGT 


120 


CTTTTTGTCA 


ACTGTAGTGG 


GTTGATATAA 


TAGTATTAGT 


GAGTGGGATA 


AAAGTTTCAT 


180 


TTAGTTTATT 


CAGTACAAAT 


TTAACGGGTC 


AAGATTTATA 


TACTAGTGGT 


GTTTTTGGGG 


240 


CTGAGAGAAG 


TATCTTGATT 


TTATGTGTGG 


TTTTTATACT 


TACAGTTGTT 


CTGCTCCAAA 


300 


GACCTTGTAG 


AGAAGAATTA 


GCTCATAAAG 


GAGATTGATT 


ATTTTGATAT 


CAAAAAAATG 


360 


CACAGGATAA 


CCTGATGCAT 


TTTTTTAGCG 


ACAATGCTTG 


CTACTTCCTT 


CTGTCGAATT 


420 


TAGACAATTT 


TAAACCCCAA 


TTATTCACCC 


CAAATCTAAA 


AACCATCCAG 


AATCCTTGCC 


430 


TTAGCTTAGA 


TCCTGGATGG 


TTTCTTTTTT 


CACCCAATGG 


GTGTTTTTTA 


CTAGACAAAA 


540 


AAGAGTTTCC 


CCTTTATGGT 


ATAAGTCTAG 


AAAAAAACAC 


AAAAAGAAAG 


GAAACTCACA 


600 


TGAACAGTTT 


ACCAAATCAT 


CACTTCCAAA 


ACAAGTCTTT 


TTACCAACTA 


TCTTTCGATG 


660 


GAGGTCATTT 


AACCCAGTAT 


GGTGGTCTTA 


TCTTTTTTCA 


GGAACTTTTT 


TCCCAGTTGA 


720 


AACTAAAAGA 


GCGGATTTCT 


AAGTATTTAG 


TAACGAATGA 


CCAACGCCGC 


TACTGTCGTT 


780 


ATTCGGATTC 


AGATATCCTT 


GTCCAGTTCC 


TCTTTCAACT 


GTTAACAGGT 


TATGGAACGG 


840 


ACTATGCTTG 


TAAAGAATTG 


TCAGCTGATG 


CCTACTTTCC 


AAAATTATTG 


GAAGGAGGGC 


900 


AGCTTGCTTC 


ACAGCCAACC 


TTATCCCGTT 


TTCTTTCCAG 


AACTGACGAG 


GAAACAGTCC 


960 


ATAGTTTGCG 


ATGCCTCAAC 


CTTGAATTGG 


TCGAATTCTT 


TTT 




1003 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 60 

TGAAATACTA ATTGTTTTAG CTATTATCCT ATCTCTTCCT TTGATTGTAT TGGTAACTAT 120 

ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 180 

CTACTGGCAG AGCAACACCT TGGTCAAGGT CCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 

TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 300 

CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 360 

TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTGACGAGc 420 

GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 480 

TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 540 

CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 

ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 660 

CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 720 

ATGAAGGTTT TTTTTTTT 738 
(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

( B ) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 360 

CTTCACCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCPGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 480 

CTTCCAAACC TTGCATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTC GCACACCACC AGCCATAAGG GCAAGACTTC 600 
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CCCCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT CCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 360 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

TTTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAACGAA TGACCAACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 360 

CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 420 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACACGATAGT CAAATCGATT TCTAACAATG 480 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 600 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 720 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 750 



(2) INFORMATION FOR SEQ ID NO: 383: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



TCAAATTCTT 


CGTGGTCCGC 


ATATCTnTCT 


TCGTACACGG 


CAGTCACTTG 


GTCTTTCACT 


60 


ACTCGAGTCG 


CAGCTTCACG 


GGCCAATTTC 


TCTTCTACTT 


GAACTGCCTT 


TTGGAGGTCA 


120 


CTGTTGTAGG 


CTGCAATGAT 


TTCAGCTTGC 


AATTCAGCAT 


CCACGTGAAG 


CAATTCCACT 


180 


TCTGCTTTTT 


CTTTACCGAC 


AGCAGCAACG 


ATTTCTTCTT 


GGAAGGCAAT 


CAATTCTTTG 


240 


ACAGCTTCGT 


GCCCTTTAAG 


GAGCGCTTCC 


AACATGATTT 


CTTCTGACAA 


TTCTTTGGCA 


300 


CCAGACTCTA 


CCATGTTGAT 


AGCGTGCTTG 


GTTCCAGCTA 


CTGTCAATTC 


AAGAAGAGAT 


360 


TGCTCTGCTT 


GTTCTTGACT 


TGGGTTGATG 


ATGATTTGGC 


CATCTACATA 


TCCCACTTGT 


420 


ACCCCAGCAA 


TTGGTCCGTC 


AAATGGAATA 


TCTGAAATAG ACAGTGCCAA 


AGATGAACCA 


480 


AACATAGCAG 


CCATTGGTGC 


AGATGCATTT 


TCATCATAAG 


AAAGCACTGT 


ATTGATGACT 


540 


TGGACTTCAT 


TACGGAAACC 


TTCCGCAAAC 


ATAGGACGAA TCGGACGGTC 


AATCAAACGC 


600 


GCTGTCAAGG 


TCGCATCTGT 


TGAAGGACGT 


CCTTCACGTT 


TCATAAAGCC 


ACCAGGAAAC 


660 


TTCCCAGCCG 


CATACATTTT 


TTCTTCGTAG 


TTGACTTGGA 


GTGGGAAGAA 


ATCCTCAGTT 


720 


GCCATTTTCT 


GGGGATCC 










738 



(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 384t- 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 240 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGCCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 480 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 657 



(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCACCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 
AGTGCGTCAC CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDN ESS : double 
(D> TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

CGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTCAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 420 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONBSS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA 60 

CCAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 1B0 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 420 
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ACAAG 

(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA inTJTCTT TT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 360 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGkCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAACAwGTGC TTCGGCTTCA GCGTCAACGA GTGCG TC TGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 360 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 480 
GCGCCTCAGC TTCAGCAAGC ACCGG 505 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 
TO STRAND EDN ESS : double 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 300 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 360 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 
(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GCCTGCTAAA ATATTTCTAT 420 



WO 98/18931 



PCT/US97/19588 



1398 

AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 480 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 arid 33(3) of the Danish Patents Act. If such a request has been filed by 
• the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for the International 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the 
individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without havingbeen laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391 ; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequenced), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-39 1, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS:l-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said * 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
degenerate variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5* to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 
to any one of the fragments of the Streptococcus pneumoniae genome depicted in 
SEQ ID NOS: 1 -391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a hpmolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 
comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

b) identifying members of said library which contain sequences that 
hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 



1 6, An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

J 7. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 

18, An isolated polynucleotide molecule encoding any one of the - 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

130 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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